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NEW POLYPEPTIDES AND THEIR USE FOR THE RESCUE OF FUSION 
DEFECTIVE VIRUS OR RETROVIRUS GLYCOPROTEINS 

The invention relates to new polypeptides, and their use for the rescue of fusion 
defective virus or retrovirus glycoproteins. 

Retroviruses enter cells following their attachment to specific cell surface receptors. 
This function is mediated by the envelope glycoproteins expressed as trimeric complexes on 
the viral particles (Hunter, 1997). Interaction with the receptor is thought to cause a 
conformational change of the glycoprotein structure necessary to expose a "fusion peptide", 
buried within the native envelope complex and involved in the actual membrane fusion 
process (Gerlier, 1998). On the basis of both genetic (Battini et al., 1995; Battini et al., 
1992; Morgan et al., 1993; Ott and Rein, 1992) and structural (Fass et al., 1997; Fass et 
al., 1996) evidence, the attachment and fusion functions of murine leukemia virus (MLV) 
envelopes have been separated on the two subunits of the envelope monomer. Thus it is 
generally considered that the ammo-terminal half of the MLV SU (surface) subunit is 
responsible for the binding to the receptor whereas the ectodomain of the TM 
(transmembrane) subunit contains the "fusion machinery" composed of an animo-terminal 
fusion peptide associated to a coiled-coil structure that rearrange upon receptor interaction. 

How retrovirus envelope glycoproteins convert binding to the receptor to a signs! 
which activates the fusion machinery is currently unknown. Ample evidence from uV 
literature indicate that functional inter-relations between different domains of the envelope 
complex are essential to promote fusion activation (Lavillette et al., 1998; Rein et al., 1998; 
Zhao et al., 1997; Zhao et al., 1998). 

Retroviruses can be involved in two different types of membrane fusion : 

- a fusion between viral and cell membranes triggered by the interaction of the virion 
envelope glycoproteins with specific cell surface molecules, which are the viral receptors 
(virus-cell fusion), with said fusion being responsible for retroviral infection, 

- a fusion between two cell membranes triggered by the interaction of the envelope 
glycoproteins expressed at the surface of a cell and the viral receptors expressed on a 
neighbouring cell (cell-cell fusion), the visible outcome of this latter being the formation of 
cell syncytia containing up to 100 nuclei (also known as polykaryocytes or multinucleated 
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giant cells); syncytium-formation results in the death of the cells which make up the 
syncytium. 

Furthermore, cleavage of the R peptide (inhibitory peptide on the cytoplasmic tail of 
the TM subunit of the envelope glycoprotein) is required to activate fully the fusogenic 
5 potential of these envelope glycoproteins and mutants lacking the R peptide show greatly 
enhanced activity in cell fusion assays (Rein et al, J. Virol., 1994, 68, 1773-1781; Ragheb 
& Anderson, J. Virol., 1994, 68, 3220-3231; Brody et al, J. Virol., 1994, 68, 4620-4627). 

Furthermore, it is known that a vector directing the expression on a eukaryotic cell 
surface of a syncytium-inducing polypeptide can be used in gene therapy of malignant 
10 diseases, as a result of selective elimination of unwanted cells, through syncytia formation 
(WO 98/40492). 

The fusogenicity of envelope glycoproteins has been enhanced as described in 
Lavillette et al. (1998) and internal application WO 99/36561. The proline-rich region 
(PRR) of amphotropic MLV envelope glycoprotein was shown to be an essential 
15 determinant of membrane fusion activation and specific PRR-mutated envelope 
glycoproteins were obtained and were characterised by an increased cell-cell or virus-cell 
fusogenicity. 

Studies on Mo-MLV envelope protein GP70 have shown that the first eight amino 
acids of the mature gp70 protein are essential for fusion and virus entry. Within this region, 
20 a particularly important residue is H8. This residue belongs to the PHQ motif which is 
conserved among all MLVs and some other retroviruses, such as a gibbon ape leukemia 
virus SEATO and feline leukemia virus types A, B, and C. The amphotropic counterpart of 
Del2-8, which has the first six amino acids (Met-Ala-Glu-Ser-Pro-His) deleted has been 
constructed. Vectors bearing the deletion mutant envelope protein could not transduce NIH 
25 3T3 or HeLa cells (Bae et al., Journal of Virology, March 1997, p. 2092-2099). 

As the fusion mechanisms induced by interaction of retrovirus envelope 
glycoproteins and receptors of cell membrane are partially unknown, the regulation of the 
fusogenicity of retroviral or viral envelope glycoproteins was up to day not possible. 

One of the aims of the invention is to enable fusion control of envelope glycoproteins 
30 with target cells. 

It is an aim of the invention to provide means for the rescue of the fusion property of 
fusion defective viral or retroviral envelope glycoproteins. 
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It is another aim of the invention to provide a process for restoring the fusogenic 
property of retroviral or viral envelope glycoproteins, the fusogenic property of which has 
been inactivated and for controlling their activation by external means. 

It is another aim of the invention to provide a process in which envelope 
5 glycoproteins are in a first stage inactivated as to their fusion property and then in a second 
stage activated in a controlled and regulated way, by means of activating polypeptides, 
which possibly present targeting capacities, and which are able to restore efficiently the 
fusogenic and cytotoxic properties of said envelope glycoproteins. 

It is another aim of the invention to provide polypeptides liable to activate the cell- 
10 cell fiisogenicity or the virus-cell fusogenicity. 

It is another aim of the invention to provide polypeptides liable to activate the virus- 
cell fusogenicity and which thus can be used in strategies involving retroviral vectors 
comprising envelope glycoproteins, the fusion property of which can be activated. 

It is another aim of the invention to provide fusogenic envelope glycoproteins 
15 presenting cytotoxic properties for use in the treatment of malignant diseases, for which the 
fusogenic property could be controlled by means of activating polypeptides. 

It is another aim of the invention to provide cytotoxic fusogenic envelope 
glycoproteins having cellular specificity and for which the fusogenic property can be 
controlled by means of activating polypeptides. 
20 All these aspects are achieved through the present invention, which in one of its 

broad embodiments is the use of at least one activating polypeptide comprising an amino 
acid sequence corresponding to : 

- the amino acid sequence of the surface subunit (SU) of an envelope glycoprotein of a first 
virus, or of a first retrovirus, or 

25 - an amino acid sequence derived from said surface subunit, or 

- a fragment of an amino acid sequence of said surface subunit, encompassing the receptor 
binding domain (RBD) and the amino terminal part of the surface subunit (SU), 

for the rescue of the fusion property of a fusion defective envelope glycoprotein, which has 
the property of attachment to target cells and is inactive with respect to the fusion with said 
30 target cells, said fusion defective envelope glycoprotein belonging to a second virus or a 
second retrovirus of the same type as or of a different type from the above designated "first 
virus or first retrovirus" . 
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Thus the invention lies in the possibility of desactivating the inactivation of the 
fusogenic property of an envelope glycoprotein, said envelope glycoprotein having 
previously been made inactive with respect to fusion. This restoration of the fusion property 
of the envelope glycoprotein is unexpectedly achieved by polypeptides whose common 
5 structure is the presence of the receptor binding domain (RBD) and the amino terminal part 
of SU in the environment of cells expressing fusion defective envelope glycoproteins and of 
cells expressing retroviral or viral receptors and the interaction of which will lead to the 
formation of syncytia, or in the environment of retroviruses incorporating fusion defective 
envelope glycoproteins and of cells expressing retroviral or viral receptors and the 

10 interaction of which will lead to infection. 

The expression "polypeptide comprising an amino acid sequence corresponding to", 
means that the polypeptide has the same amino sequence as that of the surface subunit, or as 
that of an aminoacid sequence derived from the surface subunit or of a fragment and can be 
prepared by recombinant technique or by chemical synthesis. 

15 The expression "polypeptide comprising an amino acid sequence derived from said 

surface subunit" means that the polypeptide has an aminoacid sequence which is not strictly 
identical to that of the sequence of the surface subunits and may contain mutations such as 
deletion or substitution of aminoacids of the parental sequence or insertion of aminoacid 
sequence of other origins. 

20 The "amino terminal part of the surface subunit" is defined as the aminoacid 

sequence of the first aminoacids of a length of about 10 to about 15 aminoacids of the 
envelope surface subunit, not including the aminoacid sequence of the signal peptide. 

A fusion defective envelope glycoprotein derives from a parental envelope 
glycoprotein in which the fusogenic property has been inactivated, but which still presents 

25 the property of recognizing (attachment) target cells. 

The expression "inactive with respect to the fusion" means that in a test for the measure of 
the fusogenic activity, the fusion index is either nul or significantly reduced when the fusion 
assay is allowed to proceed for about 20 minutes. 

The rescue of the fusion property can be measured according to one of the following 

30 tests : 

- test of formation of syncytia : expressing vectors encoding the envelope glycoprotein to be 
tested are transfected into cells on day 0. On day 2, the transfected cells are overlaid by 
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indicator cells which express the cell surface receptors corresponding to the tested envelope 
glycoprotein. Detection of syncytia (i.e. fusion of envelope-expressing cells with 
neighbouring indicator cells) is performed at day 3 or day 4 and fusion indexes are 
measured as previously described (Lavillette et al, 1998) by counting the number of 
5 syncytia and the number of nuclei outside and inside the syncytia. 

- test of infection of target cells : retroviral vectors carrying a lac Z transgene are being 
generated with the tested envelope glycoproteins. Serial dilutions of the vectors are applied 
on target cells at day 0. Infected cells are stained at day 2 with X-Gal to reveal the number 
of cell colonies having integrated the lac Z-vectors and titers are determined according to 

10 standard procedures (Lavillette at al., 1998). 

It is to be noted that the activating polypeptide corresponds to the amino acid 
sequence of the SU of an envelope glycoprotein of a first virus or retrovirus and that the 
fusion defective envelope glycoprotein belongs to a second virus or retrovirus of the same 
type or of a type different from the first virus or retrovirus. This means that there are in fact 

15 two possibilities : 

- either the first and second viruses are of the same type, which means that the activating 
polypeptide and the fusion defective envelope glycoprotein are homologous, 

- or the first and second viruses are of a different type, which means that the activating 
polypeptide and the fusion defective envelope glycoprotein are heterologous. 

20 According to an advantageous embodiment the fusion defective envelope 

glycoprotein involved are those derived from retroviruses. 

Retroviral envelope glycoproteins allow receptor binding of the viral particles onto 
which they have been incorporated during virion assembly. This preliminary attachment on 
the viral receptor activates a second function of the retroviral envelope glycoproteins which 

25 induces the fusion between the viral and the target cell membranes. While for type C 
mammalian retroviruses the determinants of receptor binding (RBD, receptor binding 
domain), are located at the ammo-terminal end of the SU envelope subunit, the molecular 
determinants that are responsible of membrane fusion are located in the TM subunit of the 
glycoprotein. The molecular determinants or events which are involved in the conversion of 

30 binding to the receptor into a signal which activates the TM fusion determinants are not 
precisely known although by mutagenesis analysis, they are believed to be present in all the 
SU subunit, in particular in the N-terminal end of the RBD itself (Bae et al., 1997; 



BNSDOCID: <WO 0071578A2J_> 



WO 00/71 578 PCT/EP00/04534 

6 

Lavillette et al., 2000), in the proline-rich region (Lavillette et al., 1998) and in the 
carboxy-terminal domain of the SU (Nussbaum et al., 1993; Pinter et al., 1997). Thus the 
control of the activation of the fusion machinery harbored by the TM subunit might be 
achieved by introducing specific mutations that affect the fusion-activation determinant of 
5 the SU. This invention describes several mutations in the SU which can inhibit or partially 
inhibit the SU determinants which control activation of the TM fusion-machinery. It is 
important to understand that such mutations do not impair the fusogenic potential of the 
mutant envelope glycoproteins, but only impair the molecular determinants that control the 
activation of the fusion machinery. The latter one is still intact, despite the mutations 
10 introduced in the SU, but is not activated upon receptor binding. In addition the invention 
describes the use of soluble polypeptides that can alleviate the effect of these mutations and 
hence couple the fusion machinery to a device that is provided exogenously in substitution 
to the normal fusion control determinants which are disrupted in the mutant envelope 
glycoproteins. Thus the addition of the "activating" polypeptides can restore or rescue the 
15 fusogenicity of the mutant envelope glycoproteins. 

Several mutations or modifications of the envelope glycoprotein can be used to 
reversibly inhibit or reduce their fusogenicity. 

A first method is the introduction of the delH mutation that removes an histidine 
from the PHQ conserved motif located at the ammo-terminal end of the SU envelope 
20 subunit of type C mammalian retroviruses such as MLV-A, MoMLV and GALV (mutations 
H5del, H8del or H9del, respectively). Such a mutation induces a drastic effect in the 
capacity of the receptor binding domain (RBD) of these envelope glycoproteins to triggers 
further events required to promote membrane fusion. 

A second method consists in the replacement of the SU C-terminal domain (C 
25 domain) with an heterologous C domain derived from the envelope glycoprotein of another 
type C mammalian retrovirus. For example; the chimeric CMO amphotropic glycoprotein 
(hereafter defined) can be constructed by combining the amphotropic MLV RBD and PRR 
(proline-rich region) with the ecotropic MoMLV C-terminal domain (example 4, Fig. 29A 
and 29B). For such chimeras, there is a low compatibility between their RBD (amphotropic 
30 origin) and C domain (ecotropic origin), which significantly impairs the kinetic of 
membrane fusion. 
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A third method consists in the replacement of the RBD by a ligand such as a 
cytokine or a single chain antibody targeted to a cell surface specific receptor. In contrast to 
the RBD of the SU of a type C mammalian retrovirus, such ligands do not contain the 
determinants that allow further activation of the Env fusogenicity. Such chimeric envelope 
5 glycoproteins therefore resembles a delH envelope glycoprotein which would carry an RBD 
that is targeted to a specific receptor. 

This invention relates to a method which can alleviate the inhibitory effect of these 
different mutations and thus restore the fusogenic properties of the mutant envelope 
glycoproteins. This method uses preparations of polypeptides which are derived from the 
10 RBD of different type C mammalian retrovirus envelope glycoproteins. For example, the 
addition of an RBD which is compatible with the C domain of a fusion-defective envelope 
glycoprotein will fully restore its fusogenic properties and will activate the kinetic of 
membrane fusion. Similarly such a peptide will also restore the fusogenicity of envelope 
glycoproteins that harbor a delH mutation or, alternatively, which carry a cell-type specific 
15 ligand in place of the RBD. 

This method can be applied to the activation of infectivity of retroviral vectors whose 
envelope glycoproteins carry one of these fusion-defective mutations. Depending on the 
presence of a ligand in the fusion-deficient envelope glycoprotein or, alternatively, in the 
activating polypeptide, control of infection to specific target cell types will therefore be 
20 obtained. The invention can also be applied to the control of the hyper-fusogenic activity of 
retroviral envelope glycoproteins that can induce cell-cell fusion and elimination of 
unwanted cells by syncytia formation. Indeed while most retroviral envelope glycoproteins 
are not truly fusogenic when expressed at the surface of the cells that produce the retroviral 
particles, they acquire their fusion competency during or after assembly and release of the 
25 viral particles. Yet there is considerable interest to convert the retroviral envelope 
glycoproteins into hyperfusogenic glycoproteins that can fuse cells in the close vicinity of 
the cells into which they are expressed. In addition to the introduction of the above- 
mentioned mutations that can control activation of their fusogenicity, the fusogenicity of 
retroviral envelope glycoproteins can be stimulated by several ways to obtain the best effect 
$0 upon activation with an exogenous activating polypeptide. Thus, additional mutations will 
be introduced into the retroviral envelope glycoprotein, such as the truncation of the 
cytoplasmic tail (Rless mutation) and/or the modification of their proline-rich regions. 
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According to an advantageous embodiment, the activating polypeptide used in the 
invention recognizes substantially the same receptors of the target cells as the fusion 
defective envelope glycoprotein. 

According to another advantageous embodiment, the activating polypeptide used in 
5 the invention recognizes receptors of the target cells which are different from the ones 
recognized by the fusion defective envelope glycoprotein. 

It is thus quite unexpected that the activating polypeptides can restore the 
fusogenicity of fusion defective envelope glycoproteins by interacting with retroviral 
receptors which are expressed at the surface of the same target cells, and which are different 
10 from the receptors recognized by the fusion defective envelope glycoprotein. 

According to another advantageous embodiment of the invention, the activating 
polypeptide and/or the fusion defective envelope glycoprotein is (are) linked to a ligand, 
which is specific with respect to some receptors of the target cells, the ligand being 
preferably linked to the activating polypeptide. 
15 The addition of specific ligands on the activating polypeptides and/or on the fusion 

defective envelope glycoprotein enables to control the fusion activity of said fusion defective 
envelope glycoprotein and to increase the specificity with respect to target cells. Therefore, 
the design of a specific system to eliminate unwanted cells, may consist in a fusion defective 
envelope glycoprotein associated to an activating polypeptide that is linked to a ligand that 
20 can specifically recognize a receptor expressed on the target cells, whereas the activating 
polypeptide recognizes the same receptor or a different receptor expressed on the target 
cell. 

The link between the activating polypeptide and the ligand is preferably a covalent 
bond, whereby both the activating polypeptide and the ligand have been genetically fused 
25 and encoded by an expression vector as a single protein chain, and whereby the ligand is 
fused either to the amino-terminus or to the carboxy terminus of the activating polypeptide. 

According to another advantageous embodiment, the ligand is chosen among 
molecules liable to recognize receptors of target cells, for instance receptors of tumoral 
cells, such as EGF, FGF, VEGF, or liable to recognize epitope of proteins expressed by the 
30 target cells, such as recombinant monoclonal antibody. 

According to another advantageous embodiment, the activating polypeptide is under 
the form of a product as such, or is expressed by a host, such as a virus or a cell, or is 
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expressed, in particular by the target cells, transformed by nucleotide sequences coding for 
said activating polypeptide. 

In the use of the invention, one resorts either directly to chemical substances (which 
is the meaning of activating polypeptide under the form of a product as such), or to host 
5 (virus or cell) expressing said activating polypeptide. 

In the following of the text, the expression "soluble activating polypeptides" will be 
sometimes used, meaning that said polypeptides are not integrated in the envelope proteins 
of a virus. 

The capacity of the activating polypeptides to rescue the infectivity of fusion 
10 defective envelope glycoproteins is similar whether the target cells i) constitutively express 
and secrete the activating polypeptides, ii) are co-incubated with both fusion defective 
envelope glycoprotein and said activating polypeptides iii) are pre-incubated with said 
activating polypeptides before adding the fusion defective envelope glycoprotein, or iv) are 
pre-incubated with the fusion defective envelope glycoprotein before adding the activating 
15 polypeptides. 

In an advantageous use of the invention the fusion defective envelope glycoprotein is 
expressed by a host, such as a virus incorporating in its envelope said fusion defective 
envelope glycoprotein, or such as a cell, transformed by nucleotide sequences coding for 
said fusion defective envelope glycoprotein. 
20 According to the invention, the activating polypeptide advantageously corresponds 

to a fragment of about 200 to about 250 amino acids of said surface subunit comprising the 
receptor binding domain (RBD) and comprising the first amino acids of the amino terminal 
part of the surface subunit, the number of said first aminoacids varying from about 10 to 
about 15, in particular comprising the RBD and the PHQ peptide, in particular the 
25 fragments corresponding to amino acid sequences derived from type C mammalian 
retroviruses such as: 
- Murine leukemia virus such as : 

. amphotropic 
. ecotropic 

30 . xenotropic 

. poly tropic 
. 10 Al MLV 
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- or GALV type retroviruses and related retrovirus such as SSAV, FELV-A, FELV-B, 
FELV-C 

in particular wherein the activating polypeptide corresponds to one of the following amino 
acid sequences : 

A-RBD 

E-RBD 

GALV-RBD 

34-1-A-RBD 

34-1-E-RBD 

34-1 -GALV-RBD 

EGF-A-RBD 

EGF-E-RBD 

EGF-GALV-RBD 

- A-RBD corresponds to the activating polypeptide comprising the RBD domain and the 
PHQV motif of the amphotropic MLV surface subunit. 

- E-RBD corresponds to the activating polypeptide comprising the RBD domain and the 
PHQV motif of the ecotropic MLV surface subunit. 

- GALV-RBD corresponds to the activating polypeptide comprising the RBD domain and 
the PHQP motif of the GALV surface subunit. 

- 34.1-A-RBD corresponds to the activating polypeptide A-RBD above defined, linked to a 
ligand which is a single chain antibody directed against MHCI. 

- 34.1-E-RBD corresponds to the activating polypeptide E-RBD above defined, linked to a 
ligand which is a single chain antibody directed against MHCI. 

- 34.1 GALV-RBD corresponds to the activating polypeptide GALV-RBD above defined, 
linked to a ligand which is a single chain antibody directed against MHCI. 

- EGF-A-RBD corresponds to the activating polypeptide A-RBD above defined, linked to a 
ligand which is the epidermal growth factor. 

- EFG-E-RBD corresponds to the activating polypeptide E-RBD above defined, linked to a 
ligand which is the epidermal growth factor. 

- EGF-GALV-RBD corresponds to the activating polypeptide GALV-RBD above defined, 
linked to a ligand which is the epidermal growth factor. 
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According to the invention, the fusion defective envelope glycoprotein 
advantageously corresponds to the amino acid sequence of a mutated form of an envelope 
glycoprotein of a virus or a retrovirus which, in its non mutated form, presents attachment 
and fusion properties with respect to target cells, said fusion defective envelope 

5 glycoprotein comprising one or several mutations in the amino terminal part of its surface 
subunit, particularly in the first amino acids of the amino terminal part of the surface 
subunit, the number of said first aminoacids varying from about 10 to about 15, said 
mutations inactivating the fusion property without substantially affecting the attachment 
property, in particular comprises one or several mutations within the PHQ amino sequence, 

10 wherein in particular the fusion defective envelope glycoprotein is derived from type C 
mammalian retroviruses such as: 

- Murine leukemia virus such as : 

. amphotropic 
. ecotropic 

15 . xenotropic 

. polytropic 
. 10 Al MLV 

- or GALV type retroviruses and related retrovirus such as SSAV, FELV-A, FELV-B, 
FELV-C, 

20 in particular wherein the fusion defective envelope glycoprotein corresponds to one of the 
following amino acid sequences : 

- MO del H 

- AdelH 

- PROMOdelH 
25 - GALVdelH 

- C2delH 

- MO del H R less 

- AdelHRless 

- PROMOdelHRless 
30 - GALVdelHRless 

- C2delHRless 

- C MO 
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-TM MO 

- C TM MO 

- C MOdelH 

- TM MOdelH 

5 - C TM MOdelh 
-34.1-A 

* MO del H corresponds to the ecotropic MLV envelope glycoprotein, in which the 
histidine residue of the PHQV motif is deleted (i.e. the 8th residue of the SU subunit is 

10 removed). 

* AdelH corresponds to the amphotropic MLV envelope glycoprotein, in which the histidine 
residue of the PHQV motif is deleted (i.e. the 5th residue of the SU subunit is removed). 

* PROMOdelH corresponds to the amphotropic MLV envelope glycoprotein, in which the 
proline rich region has been replaced by the proline rich region of the ecotropic MLV 

15 envelope and in which the histidine residue of the PHQV motif is deleted. 

* GALVdelH corresponds to the GALV envelope glycoprotein, (gibbon ape leukemia virus 
envelope glycoprotein), in which the histidine residue of the PHQP motif is deleted (i.e. the 
9th residue of the SU subunit is removed). 

* C2delH corresponds to the amphotropic MLV envelope glycoprotein, in which the proline 
20 rich region has been modified by a small deletion at its carboxy terminus and in which the 

histidine residue of the PHQV motif is deleted (i.e. the 5th residue of the SU subunit is 
removed). 

* MO del H R less corresponds to the ecotropic MLV envelope glycoprotein, in which the 
histidine residue of the PHQV motif is deleted and in which the R peptide of the TM 

25 subunit has been deleted. 

* AdelHRless corresponds to the amphotropic MLV envelope glycoprotein in which the 
histidine residue of the PHQV motif is deleted and in which the R peptide of the TM 
subunit has been deleted. 

* PROMOdelHRless corresponds to the amphotropic MLV envelope glycoprotein, in which 
30 the proline rich region has been replaced by the proline rich region of the ecotropic MLV 

envelope, in which the histidine residue of the PHQV motif is deleted and in which the R 
peptide of the TM subunit has been deleted. 
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* G A LVdelHRless corresponds to the GALV envelope glycoprotein, in which the histidine 
residue of the PHQP motif is deleted and in which the R peptide of the TM subunit has been 
deleted. 

* C2delHRless corresponds to the amphotropic MLV envelope glycoprotein, in which the 
proline rich region has been modified by a small deletion at its carboxy terminus and in 
which the histidine residue of the PHQV motif is deleted (i.e. the 5th residue of the SU 
subunit is removed) and in which the R peptide of the TM subunit has been deleted. 

* CMO envelope corresponds to the amphotropic MLV envelope glycoprotein, in which the 
SU carboxy-terminal domain, C domain, has been replaced by the C domain derived from 
the ecotropic MoMLV envelope glycoprotein. 

* CMOdelH envelope corresponds to the amphotropic MLV envelope glycoprotein, in 
which the SU carboxy-terminal domain, C domain, has been replaced by the C domain 
derived from the ecotropic MoMLV envelope glycoprotein and in which the histidine 
residue of the PHQV motif is deleted (i.e. the 5th residue of the SU subunit is removed). 

* TMMO envelope corresponds to the amphotropic MLV envelope glycoprotein, in which 
the ectodomain of the TM subunit has been replaced by the TM ectodomain derived from 
the ecotropic MoMLV envelope glycoprotein. 

* TMMOdelH envelope corresponds to the amphotropic MLV envelope glycoprotein, in 
which the ectodomain of the TM subunit has been replaced by the TM ectodomain derived 
from the ecotropic MoMLV envelope glycoprotein and in which the histidine residue of the 
PHQV motif is deleted (i.e. the 5th residue of the SU subunit is removed). 

* CTMMO envelope corresponds to the amphotropic MLV envelope glycoprotein, in which 
both the SU carboxy-terminal domain, C domain, and the ectodomain of the TM subunit 
have been replaced by the C domain and TM ectodomain derived from the ecotropic 
MoMLV envelope glycoprotein. 

* CTMMOdelH envelope corresponds to the amphotropic MLV envelope glycoprotein, in 
which both the SU carboxy-terminal domain, C domain, and the ectodomain of the TM 
subunit have been replaced by the C domain and TM ectodomain derived from the ecotropic 
MoMLV envelope glycoprotein and in which the histidine residue of the PHQV motif is 
deleted (i.e. the 5th residue of the SU subunit is removed). 
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* 34.1- A envelope corresponds to the amphotropic MLV envelope glycoprotein, in which 
the receptor binding domain has been removed and replaced by an single chain antibody 
against human class I MHC molecules. 

5 The invention also relates to the use of at least one activating polypeptide and of at 

least one fusion defective envelope glycoprotein for the preparation of a drug useful for the 
treatment of cancer and infectious diseases. 

The invention also relates to products containing an activating polypeptide and a 
fusion defective envelope glycoprotein as defined above, as a combined preparation for 
10 simultaneous, separate, or sequential use for the rescue of the fusion defective property of 
said fusion defective envelope glycoprotein. 

The invention also relates to products containing an activating polypeptide and a 
fusion defective envelope glycoprotein as defined above, as a combined preparation for 
simultaneous, separate or sequential use for the treatment of cancer pathologies. 
15 The invention also relates, as new products, to activating polypeptides corresponding 

to a fragment of about 200 to about 250 amino acids of said surface subunit comprising the 
RBD and the PHQV peptide, in particular the fragments corresponding to amino acid 
sequences derived from type C mammalian retroviruses such as: 
- Murine leukemia virus such as : 
20 . amphotropic 

. ecotropic 
. xenotropic 
. polytropic 
. 10 Al MLV 

25 - or GALV type retroviruses and related retrovirus such as SSAV, FELV-B 

in particular wherein the activating polypeptide corresponds to one of the following amino 

acid sequences : 

A-RBD 

E-RBD 
30 GALV-RBD 

34-1-A-RBD 

34-1 -E-RBD 
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34-1-GALV-RBD 
EGF-A-RBD 
EGF-E-RBD 
EGF-GALV-RBD 

5 The invention also relates, as new products, to fusion defective envelope 

glycoprotein to the amino sequence of an envelope glycoprotein of a virus or a retrovirus 
which in its native state presents attachment and fusion properties with respect to target 
cells, and comprises one or several mutations in the amino terminal part of its surface 
subunit, inactivating the fusion property without substantially affecting the attachment 
10 property, in particular comprises one or several mutations within the PHQ amino sequence 
of the amino terminal part of its surface subunit, wherein in particular the fusion defective 
envelope glycoprotein is derived from type C mammalian retroviruses such as: 

- Murine leukemia virus such as : 

. amphotropic 
15 . ecotropic 

. xenotropic 
. polytropic 
. 10 Al MLV 

- or GALV type retroviruses and related retrovirus such as SSAV, FELV-A, FELV-B or 
20 FELV-C 

in particular wherein the fusion defective envelope glycoprotein corresponds to one of the 
following amino acid sequences : 

- AdelH 

- PROMOdelH 
25 - GALVdelH 

- C2delH 

- AdelHRless 

- PROMOdelHRless 

- GALVdelHRless 
30 - C2delHRless 

-CMO 

- TM MO 
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- C TM MO 

- C MOdelH 

- TM MOdelH 

- C TM MOdelh 
5 -34.1-A 

with the proviso that they are different from the ecotropic MLV sequence, wherein the 
amino acids in position 2 to 8 are either substituted or deleted. 

The invention also relates to the use of at least one activating polypeptide to the 
rescue of the fusion property of a fusion defective envelope glycoprotein wherein the fusion 
10 defective envelope glycoprotein corresponds to the amino acid sequence of a mutated form 
of an envelope glycoprotein of a virus or a retrovirus which, in its non mutated form, 
presents attachment and fusion properties with respect to target cells, said fusion defective 
envelope glycoprotein being advantagously derived from type C mammalian retroviruses 
such as: 

15 - Murine leukemia virus such as : 

. amphotropic 
. ecotropic 
. xenotropic 
. polytropic 

20 , 10 Al MLV 

- or GALV type retroviruses and related retrovirus such as SSAV, FELV-A, PELV- 
IC, FELV-C and comprising : 

* one or several mutations in the amino terminal part of its surface subunit, 
25 particulary in the first amino acids of the amino terminal part of the surface subunit, the 
number of said first aminoacids varying from about 10 to about 15, said mutations 
inactivating the fusion property without substantially affecting the attachment property, in 
particular comprises one or several mutations within the PHQ amino-terminal sequence, 
wherein in particular the fusion defective envelope glycoprotein corresponds to one of the 
30 following amino acid sequences : 

- MO R less del H 

- MO del H 
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- A R less del H 

- AdelH 

- PROMO R less del H 

- PROMOdelH 

5 - GALV R less del H 

- GALVdelH 

- C2 R less del H 

- C2delH 

- C MOdelH 
10 - TM MOdelH 

- C TM MOdelh 

* or the replacement of the ammo-teniiinal domain of its surface subunit (RBD) by a 
ligand such as a growth factor, or a cytokine or a single chain antibody, the extent of said 

15 mutations inactivating the fusion property and the attachement property to the normal viral 
receptor, in particular wherein the fusion defective envelope glycoprotein corresponds to the 
following amino-acid sequence : 
-34.1-A 

* or the replacement of its SU carboxy-terminal domain and / or its TM ectodomain, 
and preferably its SU carboxy-terminal domain, by the corresponding SU carboxy-terminal 
domain and / or the TM ectodomain of another "second" glycoprotein derived from type C 
mammalian retroviruses, with the proviso that said fusion defective glycoprotein and said 
second glycoprotein are not derived from the same type C mammalian retroviruses, in 
particular wherein the fusion defective envelope glycoprotein corresponds to one of the 
following amino-acid sequences : 

- CMO 

- TM MO 

- C TM MO 



20 



25 



30 
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FIGURE LEGENDS 



Fig 1A, IB and 1C represent titres of AdelH retroviruses in the presence of soluble 
envelope fragments of the invention. Retroviral vectors carrying VSV-G or/and wild-type 
5 amphotropic, A, envelope glycoproteins were used as controls. Titres expressed as lacZ 
infectious units (i.u.) per ml of viral supernatants. In figures 1A, IB and 1C, the titres are 
on the y-axis. 

Fig 1A. Different target cell types (XC, NIH3T3, CHO-PIT-2, TE 671, from left to right) 
were incubated with both lacZ retroviruses and conditioned media containing, or not 
10 (control), the indicated polypeptides during the 3 hrs of infection. The black columns 
correspond to the control, the grey columns correspond to A-RBD and the hatched columns 
correspond to SU polypeptide. 

Fig IB. XC target cells were incubated with dilutions of A-RBD (x axis) and infected with 
LacZ retroviruses carrying the AdelH envelope glycoproteins (curve with black circles), A 
15 envelope glycoproteins (curve with black squares) or VSV-G envelope glycoproteins (curve 
with hollow squares). 

Fig 1C. Retroviral vectors carrying A envelope glycoproteins (curves with black squares) 
or AdelH envelope glycoproteins (curves with black circles) were incubated with XC target 
cells for 1 hr at 4°C to allow virion binding while preventing cell entry. After PBS washing 

20 to remove unbound retroviruses, cells were incubated at 37°C with undiluted A-RBD- 
containing supernatants for the indicated times of dilution represented on the x axis, the 
titres being expressed on the y axis. A-RBD was then eliminated from the cell supernatant 
by washing the cells four times with 1 ml PBS (resulting in dilution of unbound A-RBD by 
more than 100,000 times). Cells were then grown in regular medium for 2 days before X- 

25 Gal staining. 

Fig 2. represents the titers of A-RBD-mediated activation of AdelH retroviruses. A- 
RBD (used undiluted) was bound to XC target cells at 37 °C for 30 min. Unbound A-RBD 
was removed and cells were washed two times with PBS and incubated at 37°C. At the 
30 indicated times, cells were further washed two times with PBS and incubated at 37 °C for 3 
hrs with retroviruses carrying A envelope glycoproteins or AdelH envelope glycoproteins. 
Titers, determined 2 days later by X-Gal staining, are expressed as lacZ i.u./ml (y-axis), as 
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a function of time before infection (x-axis). The curve with black circles corresponds to 
retrovirus carrying envelope glycoproteins and the curve with black squares corresponds 
to retrovirus carrying AdelH envelope glycoproteins. 

5 Fig 3. represents titers of AdelH retroviruses after ultrafiltration. Titers expressed on 
the y-axis (as lacZ i.u./rnl) of retroviruses carrying wild-type, A, or AdelH envelope 
glycoproteins before (1) or after incubation with A-RBD (2), and following A-RBD elution 
on two successive 700 KD-cut off ultrafiltration cartridges (3). After elution AdelH 
retroviruses were subjected to re-stimulation with fresh A-RBD (4). 

10 

Fig. 4 A and 4B represent infection assays of AdelH and MOdelH retroviruses in the 
presence of RBD or RBDdelH polypeptides. 

Fig. 4A. Retroviruses carrying A (upper left) or AdelH (upper right) envelope 
glycoproteins were mixed with A-RBD (hatched column) or A-RBDdelH (grey column) 
15 during infection of XC cells. Retroviruses carrying MO (lower left) or MOdelH (lower 
right) envelope glycoproteins were mixed with E-RBD (hatched column) or E-RBDdelH 
(grey column) during infection of XC cells. Titers are expressed as lacZ i.u./ml on the y- 
axis. In Figure 4A, the black column corresponds to control. 

Fig. 4B. represents binding assays of A-RBD (solid lines) and A-RBDdelH fragments 
20 (broken lines) on Cearl3 (top), and binding assays of E-RBD (solid lines) and E-RBDdelH 
fragments (broken lines) on Cearl3 cells (bottom). The background of fluorescence (filled 
histograms) was provided by incubating the cells with supernatants devoid of envelope 
fragments. 

Fig 5 A and 5B represent the cross-activation of AdelH and MOdelH retroviruses by 
25 MLVRBDs. 

Fig. 5 A. Titers of retroviruses carrying AdelH or MOdelH envelope glycoproteins in the 
presence of the indicated RBDs. Titers are determined (y-axis) on XC target cells as lacZ 
i.u./ml. On figure 5A, the black column corresponds to control, the hatched column 
corresponds to A-RBD and the grey column corresponds to E-RBD. 
30 Fig. 5B. Titers of AdelH retroviruses (y-axis) on CHO cells expressing, or not (control), 
PiT-2 and/or mCAT-1 MLV receptors in the presence of the indicated RBDs. On figure 5B, 
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the hatched column corresponds to E-RBD, the grey column corresponds to A-RBD and the 

black column corresponds to control. 

Fig. 6 represents the A-RBD sequence 

Fig. 7 represents the E-RBD sequence 
5 Fig. 8 represents the GALV-RBD sequence 

Fig. 9 represents the EGF-A-RBD sequence 

Fig. 10 represents the EGF-E-RBD sequence 

Fig. 11 represents the EGF-GALV-RBD sequence 

Fig. 12 represents the 34.1 -A-RBD sequence 
10 Fig. 13 represents the 34.1 -E-RBD sequence 

Fig. 14 represents the 34.1 -GALV-RBD sequence 

Fig. 15 represents the AdelH sequence 

Fig. 16 represents the PROMOdelH sequence 

Fig. 17 represents the GALVdelH sequence 
15 Fig. 18 represents the C2delH sequence 

Fig. 19 represents the AdelHRless sequence 

Fig. 20 represents the PROMOdelHRless sequence 

Fig. 21 represents the GALVdelHRless sequence 

Fig. 22 represents the C2delHRless sequence 
20 Fig. 23 represents the C MO sequence 

Fig. 24 represents the C MO delH sequence 

Fig. 25 represents the TM MO sequence 

Fig. 26 represents the TM MOdelH sequence 

Fig. 27 represents the C TM MO sequence 
25 Fig. 28 represents the C TM MO delH sequence 

Fig. 29 A represents the schematic structure of chimeric amphotropic envelope glycoproteins 

in which the C domain (CMO chimera), the TM ectodomain (TMMO chimera) or both C 

and TM domains (CTMMO chimera) are replaced by the ecotropic counterpart derived 

from MoMLV envelope glycoprotein. The CMOdeiH, TMMOdelH and CTMMOdelH 
30 chimeric envelope glycoproteins harbor the H5del mutation. 
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Figure 29B represents the kinetic of infection of retroviral vectors carrying the wild-type 
amphotropic envelope glycoprotein (A) or the CTMMO amphotropic chimera on XC cells 
in the presence, or not, of soluble polypeptides encompassing the amphotropic RBD (A- 
RBD) or the ecotropic RBD (E-RBD), as indicated. 

5 The curve with points and black circles corresponds CTMMO on XC cells + E-RBD 
The curve with points and hollow circles corresponds, to A on XC cells + E-RBD 
The curve with continuous line and hollow circles corresponds to A on XC cells 
The curve with dotted line and hollow circles corresponds to A on XC cells + A-RBD 
The curve with continuous line and black circles corresponds to CTMMO on XC cells 

10 The curve with dotted line and black circles corresponds to CTMMO on XC cells + A- 
RBD 

The X axis corresponds to the time expressed in seconds. 
The Y axis corresponds to the number of infected cells. 
Fig. 30 represents the 34.1-A Sequence 

15 

Example 1: 

MATERIALS AND METHODS 

Cell lines. The TELCeB6 cell line (Cosset et al., 1995b) has been derived from the TELac2 
20 line after transfection and clonal selection of a Moloney murine leukemia virus (MoMLV)- 
based expression plasmid to produce Gag and Pol proteins. The TELac2 cells were 
originally derived from the TE671 human rhabdomyosarcoma cells (ATCC CRL8805) to 
express the nlsLacZ reporter retroviral vector (Takeuchi et al., 1994). Production of 
infectious retroviral particles by TELCeB6 cells depends on newly introduced envelope 
25 expression vectors. 

Cerd9 and Cearl3 cells (Kozak et al., 1995) and CHO-PiT-2 cells (Rodrigues and Heard, 
1999) were derived from CHO (Chinese hamster ovary) cells (ATCC CCL-61) and express 
ecotropic MLV receptors alone or both ecotropic and amphotropic receptors or amphotropic 
MLV receptors alone, respectively. 
30 XC-A-RBD cells were derived from XC rat sarcoma cells (ATCC CCL-165) after 
transfection with the pA-ST plasmid expressing the amino terminal receptor binding domain 
of the amphotropic envelope glycoprotein (Battini et al., 1996). 
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TE671 and TELCeB6 cells were grown in DMEM (Life-Technologies) supplemented with 
10% fetal bovine serum. NIH-3T3 mouse fibroblasts were grown in DMEM supplemented 
with 10% new-born calf serum. Cerd9, Cearl3, CHO-PiT-2 and CHO cells were grown in 
DMEM supplemented with 10% fetal bovine serum and with proline (Life-Technologies). 

5 

Construction of envelope expression vectors. Plasmids FBASALF and FBMOSALF 
carrying a phleomycin-resistance gene and encoding the MLV-4070A amphotropic (noted as 
A) and MoMLV ecotropic (noted as MO) envelope glycoproteins, respectively, have been 
described elsewhere (Cosset et al., 1995a) and were used as backbones for construction and 

10 expression of envelope mutants. All constructs were generated by PCR-mediated and 
oligonucleotide site-directed mutagenesis (details and sequences available upon request). 
The FBASALF plasmid was modified to produce a cell entry-defective form of the 
amphotropic glycoprotein, designated AdelH envelope, by deleting the 36th codon of the 
4070A env gene (Ott et al., 1990). The resulting mutant envelope glycoprotein, in which 

15 the 5th residue of the SU envelope subunit was removed, was named AdelH. The 
expression plasmid encoding the fusion-defective MOdelH envelope glycoproteins, 
harboring a deletion of the 8th residue of the SU envelope subunit corresponding to the 41th 
codon of the MoMLV env gene (Shinnick et al., 1981), was derived from FBMOSALF. 
Plasmids encoding soluble receptor binding domains (RBD) were derived from FBASALF 

20 and FBMOSALF expression vectors. The boundaries of either amphotropic (A-RBD) and 
ecotropic (E-RBD) RBDs, defined as A32244 and A34269, were fused in frame with the 
carboxy-terminal ends of either envelope glycoproteins by fusion to aminoacids R623 or 
R634, respectively . Residues are numbered starting from the initiation methionine deduced 
from the aminoacid sequences of the 4070A amphotropic MLV (Ott et al., 1990) or the 

25 Moloney MLV (Shinnick et al., 1981). Expression vectors encoding either A-RBDdelH or 
E-RBDdelH were generated similarly by using the plasmids expressing the AdelH or 
MOdelH envelope glycoproteins. In plasmids encoding A-RBD, E-RBD, A-RBDdelH or E- 
RBDdelH polypeptides, the 11 aminoacid-long VSV tag (YTDIEMNRLGK) (Kreis and 
Lodish, 1986) was inserted after either RBDs. 

30 

Transfections, production of RBDs and infection assays. Envelope glycoproteins 
expression plasmids were transfected into TELCeB6 as reported elsewhere (Cosset et al., 
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1995a). RED expression vectors were transfected by calcium phosphate precipitation in 
NIH-3T3, XC or TE671 cells. Transfected cells were selected with phleomycin (50 |ag/ml) 
and phleomycin-resistant colonies were pooled. 

RBD-containing supernatants were collected after an overnight production from confluent 
5 RBD-transfected cells, filtrated trough 0.45 micron pore-sized membranes and stored at 
4°C. Virus-containing supernatants were collected after an overnight production from 
freshly confluent env-transfected TELCeB6 cells in regular medium. 

The supernatant were filtered trough 0.45 micron pore-sized membranes and diluted in 
DMEM for titration assays. Target cells were seeded in 24 multi-well plates at a density of 

10 5xl0 4 cells per well and incubated overnight at 37°C. Unless otherwise indicated in figure 
legends, 200 \xl of conditioned cell culture media containing the RBDs were added to the 
cells after removing their supernatants. Then 200 p,l of viral supernatant dilutions containing 
5 p.g/ml polybrene were added and cells were incubated for 3 hrs at 37°C. Cell supernatants 
were then removed and cells were incubated in regular medium for 48 hrs. X-Gal staining 

15 and viral titer determination were performed as previously described (Cosset et al., 1995a) 
and expressed as lacZ i.u. (infectious units)/ml. 

Antibodies. Anti-SU: a rat monoclonal antibody 83A25 (Evans et al., 1990) cell culture 
supernatant against MLV SU used undiluted for FACS analysis. Anti-VSV tag: a purified 
20 mouse monoclonal antibody P5D4 (SIGMA-Aldrich) used diluted 1/100 for FACS analysis. 

Binding assays and FACS analysis. For binding assays, target cells were washed in PBS 
and detached by a 10 min incubation at 37°C with versene 0.02% in PBS. Cells were 
washed in PBA (PBS with 2% FCS and 0.1% sodium azide). 5xl0 5 cells were incubated 

25 with 1 ml of conditioned supernatants containing the VSV tagged RBDs for 45 min at 37°C. 
Cells were then washed with PBA and were incubated with the P5D4 antibody, for 45 min 
at 4°C. Cells were washed twice with PBA and incubated with anti-mouse FITC-conjugated 
antibodies (DAKO, U.K.), respectively. 5 min before the two final washes in PBA, cells 
were counter-stained with 20 |ag/ml propidium iodide. Fluorescence of living cells was 

30 analysed with a fluorescent-activated cell sorter (FACSCalibur, Beckton Dickinson). 
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RESULTS 

HSdel phenotype can be efficiently compensated by soluble A-RBD. Viral particles were 
generated with either wild-type amphotropic A or AdelH envelope glycoproteins and used to 
infect target pells in the presence or in the absence of soluble SU or soluble A-RBD, an 
5 envelope fragment encompassing the amphotropic receptor binding domain. When SU or A- 
RBD were provided during infection the infectivity of retroviruses carrying wild-type 
amphotropic envelope glycoproteins was decreased by approximately 5-100 times (Fig. 1A), 
most likely owing to the partial blocking of PiT-2 receptors on the target cell surface. In 
striking contrast, while in the absence of envelope fragments retroviruses generated with 

10 AdelH envelopes were very poorly infectious, with titers in the range of lOMO 2 lacZ i.u. 
(infectious units)/ml, the presence of SU or of A-RBD in the medium of infected cells could 
strongly stimulate the infectivity of the former retroviruses by up to 30,000 fold (Fig. 1A). 
Stimulation of AdelH infection with either soluble envelope fragments was detected on a 
wide range of target cell types including rat XC cells, 3T3 murine fibroblasts, TE671 

15 human cells and PiT-2-transfected CHO hamster cells, though with different efficacies (Fig. 
1A). Significant differences were detected when soluble SU or A-RBD were used to activate 
the AdelH retroviruses. Comparatively to soluble SU, A-RBD was 100 to 1,000 fold more 
potent to activate AdelH retroviruses (Fig. 1A). These results indicated that the 
determinants of activation were located in the first half of the SU protein. Therefore to 

20 facilitate the characterization of this activation pathway, the subsequent experiments were 
performed by using the soluble RBD. A-RBD could fully activate AdelH retroviruses, when 
diluted 10 fold and retained 90% of activity at a 1:100 dilution, thus suggesting that it was 
effective at very low doses (Fig. IB). In contrast the infectivity of control retroviruses 
pseudotyped with either wild-type amphotropic envelopes or with VSV-G glycoproteins was 

25 weakly increased or unchanged, respectively, when A-RBD was diluted (Fig. IB). 
Additionally the origin of the cell types used to produce the envelope fragments did not 
influence the stimulation of infectivity since conditioned media harvested from 3T3, TE671 
or XC cells that were engineered to express the envelope fragments could similarly activate 
the infectivity of AdelH retroviruses (data not shown). Finally activation of AdelH 

30 retroviruses was found to be the result of a specific interaction of A-RBD with the PiT-2 
amphotropic receptor. Indeed while A-RBD could efficiently rescue the infectivity of AdelH 
retroviruses on PiT-2-transfected CHO cells, no infection could be detected when using 
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parental CHO cells as target cells (see below and Fig. 5A). Altogether these data 
demonstrated that activation of AdelH retroviruses by A-RBD was specific. 
The capacity of the soluble envelope fragments to rescue the infectivity of AdelH 
retroviruses was similar whether the target cells i) constitutively expressed and secreted the 
envelope fragments, ii) were co-incubated with both AdelH retroviruses and A-RBD, iii) 
were pre-incubated with A-RBD prior to infection, . or iv) were pre-incubated with the 
AdelH retroviruses before adding A-RBD (Table 1). Using this latter experimental 
condition, the kinetic of activation of infection was found very rapid. Indeed a brief 
incubation of target cells with A-RBD, by no more than 30 sec, was found sufficient to fully 
0 stimulate the infectivity of pre-bound AdelH retroviruses (Fig. 1C). 

A-RBD stimulates the target cell membrane rather than the AdelH envelope. In theory, 
it is possible that the envelope fragments might rescue the infectivity of AdelH retroviruses 
either by interacting with the fusion-defective viral particles or by activating the target cell 
membrane upon receptor binding. Indeed within the envelope complex, the SU subunits are 
15 not tightly attached to either the TM subunits or to the other SU units and can easily shed 
off the viral particles (Hunter and Swanstrom, 1990). However the reassociation of shed SU 
with the viral particle is unlikely since they are incorporated on virions by virtue of then- 
association with the TM subunit that results from their synthesis as a common SU-TM 
polypeptide precursor (Hunter and Swanstrom, 1990). Therefore it is unlikely that soluble 
20 SU or A-RBD mixed with AdelH retroviruses may interact with the envelope complex of 
the latter retroviruses and thus rescue their fusion defect. Nevertheless the following 
experiments were performed to address this possibility. 

A-RBD was bound to target cells at 37°C for 30 min. After removal of unbound fragments 
by washing target cells, binding of A-RBD was verified by FACS analysis. Cells were then 

25 incubated at 37°C for 0 (TO) to 28 hrs (T28) to allow internalisation of receptor/A-RBD 
complexes (Rodrigues and Heard, 1999) and A-RBD disappearance from the cell surface, as 
checked by FACS analysis (Data not shown). Cells were then washed again to remove A- 
RBD that may have been released by the cells and the AdelH retroviruses were then added 
at the different time points. Comparatively to infection performed at TO, no significant 

30 decrease of infectivity could be detected when the AdelH retroviruses were added until 8 
hrs after the initial binding with A-RBD. Likewise the infectivity of AdelH retroviruses 
added at T20 was at least 50 fold higher than before pre-stimulation of the cells with A- 
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RBD, thus indicating that stimulation of infectivity was durable (Fig. 2). Since A-RBD was 
efficiently internalised and removed from the cell surface after 4 hrs of incubation (Data not 
shown), these results indicated that activation of AdelH retroviruses most likely occurred by 
activation of the cell surface rather than by interaction with the AdelH envelope complex 
5 itself. 

Further evidence indicated that A-RBD could not associate with the AdelH retroviruses. 
Indeed retroviruses generated with either A or AdelH envelopes and mixed or co-expressed 
with A-RBD were not found to incorporate A-RBD as shown by western-blotting of purified 
viral particles with antibodies recognising A-RBD (data not shown). However these 

10 experiments could not exclude that undetected quantities of A-RBD might be associated with 
the virions. Therefore to rule out this possibility, a mixture of A-RBD and retroviruses 
carrying either AdelH or wild-type A envelope glycoproteins were separated on two 
consecutive 700 KD-pore sized ultrafiltration columns in order to elute out A-RBD. No 
decrease of infectivity could be detected for retroviruses carrying wild-type amphotropic 

15 envelopes after the two subsequent ultrafiltrations (Fig. 3), demonstrating that this process 
did not affect the viability of the retroviruses. However while before ultrafiltration the 
AdelH viral particles mixed with A-RBD could efficiently infect XC target cells, as 
expected, only a residual infectivity could be detected after A-RBD elution (Fig. 3), 
indicating that stimulation of AdelH retroviruses was lost after separation from A-RBD. As 

20 control, the AdelH retroviruses processed through the two columns retained their full 
capacity to be stimulated by newly added A-RBD (Fig. 3). Similar experiments were 
conducted after separation of AdelH retroviruses from A-RBD by sephacryl 
chromatography and led to the same conclusions (data not shown). 

Altogether these data indicated that stimulation of infectivity of the AdelH retroviruses 
25 proceeded via activation of the target cell membrane, most likely as a consequence of 
interaction of A-RBD with the PiT-2 amphotropic receptor, rather than via interaction with 
the AdelH envelope incorporated on the retrovirus itself. Thus A-RBD could activate the 
target cell membrane which in turn became competent to allow entry of retroviruses 
carrying the fusion-defective AdelH envelope glycoproteins. 
30 The SU amino-terminal end activates a cell entry pathway common to type C 
mammalian retroviruses. The histidine residue deleted in the H5del mutant glycoprotein 
belongs to a peptide motif, PHQ, found at the amino-termini of the SUs of all type C 
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mammalian retroviruses, suggesting a conserved function of this motif. Thus the H8del 
mutant ecotropic envelope glycoprotein (MOdelH), harboring the deletion of the histidine in 
the PHQV peptide motif of the MoMLV SU 3 is impaired for both cell-cell and virus-cell 
fusion (Bae et al., 1997). Similarly the H9del mutation of the GALV envelope glycoprotein 
5 GAdelH mutant results in impairement of both cell-cell and virus-cell fusion. Further 
experiments were performed to demonstrate that MOdelH, AdelH and GAdelH (delH) 
envelopes were phenotypically similar. Indeed the E-RBD fragment, containing the 
MoMLV ecotropic receptor binding domain, could rescue the infectivity of MOdelH 
retroviruses by up to 1,000 fold (Fig. 4A). Similar results were obtained for GAdelH 

10 retrovirus, whose fusogenicity could be efficiently rescued by soluble envelope fragment 
encompassing the GALV envelope glycoprotein receptor-binding domain. This result 
indicated that fusion activation by soluble envelope fragments was not particular to AdelH 
envelopes and suggested that the MLV SU amino-terminus may convey a signal common to 
other type C mammalian retrovirus which may be required to achieve early post-binding 

15 events. 

To address this hypothesis the delH mutations, H5del, H8del, were introduced in the A- 
RBD or in the E-RBD fragments, respectively. The resultant A-RBDdelH and E-RBDdelH 
could bind their respective cell surface receptors as efficiently as the parental envelope 
fragments (Fig. 4B). They could also decrease the infectivity of retroviruses bearing wild- 

20 type amphotropic or ecotropic glycoproteins, respectively, by up to 10-fold (Fig. 4A). This 
effect was most likely a consequence of partial receptor blocking. However in contrast to A- 
RBD or to E-RBD, neither the A-RBDdelH nor the E-RBDdelH fragments could rescue the 
infectivity of AdelH or MOdelH retroviruses, respectively (Fig. 4A). These data therefore 
demonstrated that the integrity of the amino-terminal ends of either RBD fragments was 

25 absolutely required to activate postbinding functions and suggested that an essential 
determinant of cell membrane activation may reside at the amino-terminal end of the MLV 
RBD. Thus the so-called "receptor binding domain" located in the amino-terminal half of 
the MLV SU may in fact be composed of two different entities, one involved in specific 
receptor binding and a second, likely to be non functional in AdelH or MOdelH envelopes, 

30 involved in transmission of a signal that may activate cell entry upon receptor binding. 

Therefore to test whether the binding function and the activating function of the MLV 
receptor binding domain could be uncoupled, cells "infected" with AdelH or MOdelH 
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retroviruses were cross-incubated with either E-RBD or A-RBD polypeptides, respectively. 
Interestingly the infectivity of MOdelH retroviruses could be activated by either A-RBD or 
E-RBD fragments (Fig. 5 A). This cross -activation was reciprocal since either fragments 
could activate, with similar efficacies, the infectivity of AdelH retroviruses (Fig. 5A). The 
5 cross-reactivity of either RBDs was dependent on the presence of both mCAT-1 and PiT-2 
receptors on the cell surface since neither PiT-2-transfected CHO cells nor mCAT-1- 
transfected CHO cells could be infected when E-RBD was used to activate the infectivity of 
AdelH retroviruses (Fig. 5B) and vice-versa (data not shown). The specificity of activation 
of AdelH/MOdelH retroviruses by ecotropic envelope fragments was further demonstrated 

10 by using the MOD84K-ST envelope fragment, harboring the D84K point mutation that 
inactivates MoMLV envelope binding (MacKrell et al., 1996). This fragment could activate 
neither AdelH nor MOdelH retroviruses despite the presence of both amphotropic and 
ecotropic receptors on target cells (Fig. 5A). Cross-activation of AdelH retroviruses could 
also be obtained by using soluble envelope glycoproteins from Gibbon Ape leukemia virus 

15 (data not shown), thus indicating that infection by delH retroviruses could be activated by a 
pathway common to type C mammalian retrovirus. 

Table 1. Activation of AdelH retroviruses by A-RBD 



20 Cells Pre-binding a binding and infection 13 Titre c 



XC - lacZ(AdelH) 5x10* 

XC - lacZ(AdelH) d + A-RBD d 3.1xl0 5 

XC-A-RBD e - lacZ(AdelH) 9xl0 4 

25 XC A-RBD lacZ(AdelH) 1.1x10 s 

XC lacZ(AdelH) A-RBD 2.7X10 5 



a: Target cells were pre-incubated for one hr at 4°C with conditioned supernatants 
containing A-RBD (used undiluted) or with lacZ retroviral vectors carrying AdelH envelope 
30 glycoproteins, as indicated. 

b: Target cells were incubated for 3 hrs at 37 °C. 

c: Titres as lacZ i.u./ml determined 48 hrs post-infection. 
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d: Both A-RBD (used undiluted) and AdelH retroviruses were added at the same time, 
e: XC cells constitutively expressing A-RBD polypeptides. 

Example 2: Control of syncytia formation by RBD polypeptides 

5 

Ideally, the transfer of therapeutic genes into tumour cells should lead both to direct 
local cell destruction and to activation of anti-tumour immunity to clear tumour deposits to 
which the genes cannot be delivered. The most commonly used genes for the control of 
local tumour growth have been the suicide gene/prodrug systems such as Herpes Simplex 

10 Virus thymidine kinase (HSVtk)/Ganciclovir (GCV) or cytosine deaminase (CD)/5FC 
systems. Importantly, these systems not only kill gene-transduced target cells but also close 
neighbours via local bystander effects. These effects depend upon cell-cell communication 
or on the diffusion of toxic metabolites, but generally act only over short ranges. 
Nonetheless, the existence of such bystander effects are crucial to compensate for the 

15 relatively poor efficiencies of gene transfer that are currently possible. However, results 
from early clinical trials have demonstrated that both better delivery systems and more 
effective genes are clearly still required to improve the efficiency of both local tumour 
control and of anti tumour immune responses. Hence, therapeutic genes with greater local 
killing activity would represent a significant advance for gene transfer strategies for cancer. 

20 It has been recently demonstrated that the well-known ability of several viral 

fusogenic membrane glycoproteins (FMG) to fuse target cells could be exploited 
therapeutically to kill tumour cells. These previous results introduced the FMG as a novel 
class of therapeutic genes which are more potent than existing suicide gene systems in 
killing tumour cells and do so by mechanisms which are highly immunostimulatory. Table 2 

25 shows the results of four FMGs derived from the non fusogenic ecotropic or amphotropic 
murine leukemia virus glycoproteins, whose fusogenicity were increased by deleting the 
p2R peptide (naturally removed during retrovirus egress) for the ARless and MoRless FMG 
and also by modifying the proline-riche region (Lavillette et al., 1998) for the C2Rless and 
PROMORless FMGs. 

30 The four resulting FMGs, Arless, C2Rless and PROMORless and MORless induced 
syncytia formation in most of the cell types that express amphotropic receptors and/or 
ecotropic receptors. However for selective eliminations of cells as required in cancer gene 
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therapy protocols, the design of a mechanism that can control the formation of syncyia is 
required. We therefore introduced in all four FMGs the H5del or the H8del mutation (delH 
mutation) which removes the 5th or the 8 th residue of the amphotropic or the ecotropic SU, 
respectively, an histidine belonging to the PHQV conserved motif. This mutation could 
5 inactivate an early event of the fusion process after receptor binding (Example 1 and 
Lavillette et al., 2000). As previously demonstrated the virus-cell fusion activity of delH 
MLV envelopes bearing the delH mutation can be efficiently rescued by using polypeptides 
encompassing the first 213 amino-acids of amphotropic MLV SU (A-RBD) or the first 236 
amino-acids of ecotropic MoMLV SU (E-RBD) (example 1 and Lavillette et al., 2000). 

10 Consistently with Example 1, the data in Table 2 demonstrated that the cell-cell fusion 
activity of the four FMGs engineered to harbor the delH mutation, AdelRless, 
C2delHRless, and PROMOdelHRless and MOdelHRless, could be rescued by providing 
either the A-RBD or E-RBD polypeptides thus leading to syncytia formation and elimination 
of target cells that express the relevant viral receptors. Interestingly formation of syncytia 

15 by delH FMGs could be achieved by using an heterologous polypeptide (E-RBD) that 
interacted with a receptor different of that recognised by the delH FMGs. 

Table 2. Modulation of cell-cell fusion by soluble envelope fragments 



25 



30 



35 



Envelope RBD 


Syncytia 


delH envelope 


RBD 


Syncytia 


A 










ARless 


+ + 


AdelHRJess 












A-RBD 


+/- 








E-RBD 


+ 


C2Rless 


+ + + 


C2delHRless 












A-RBD 


+ /- 








E-RBD 


+ + 


PROMORless - 


+ + + + 


PROMOdelHRless 




+/- 








A-RBD 


+ + 








E-RBD 


+ + + + 


MORless 


+ + + + 


MOdelHRless 












A-RBD 


+ + + + 








E-RBD 


+ + + + 



fusion indexes (calculated according to Lavillette et al., 1998) -: less than 100, +/-: 100-304, 
+ : 300-1000, + + : 1000-10 000, + + + : 10000-15000, + + + + : > 15000 
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Of note, similar results of fusion activation with RBD polypeptides were obtained 
with the GALVdelHRless envelope glycoprotein, indicating that the delH mutation of the 
potent GALVRless FMG (Bateman et al., 2000) can be controlled by adding a soluble Env- 
5 derived polypeptide (Lavillette and Cosset, unpublished results). These data indicate that it 
is possible to exploit a well-known phenomenon - the fusion of virally infected cells with 
uninfected cells as a novel, potent cytotoxic therapy using gene transfer. There are several 
options to exploit the use of delH FMGs in cell elimination stategies. Firstly viral vectors 
such as adenovirus vectors can be used to transduce both the delH FMG and the gene 

10 encoding the activating RBD polypeptide in cancer cells in vivo. Secondly cells transduced 
in vitro or ex vivo by a vector expressing a delH fusion-defective FMG can be induced to 
form syncytia after inoculation of an RBD presented as a protein preparation. In addition as 
well as using direct gene transfer, delH FMG and RBD can be effectively delivered through 
cell-mediated mechanisms, which will also be useful as an option for in vivo delivery of 

15 FMG into tumours for both local tumour eradication as well as enhancement of anti tumour 
vaccination strategies. 

Example 3: Addition of iigands on RBD polypeptides 

20 To further control the fusion activation of FMGs by RBD polypeptides, a ligand was 

inserted at the amino-terminal end of the RBD. The anti-MHC-I 34.1 scFv was chosen as 
we previously demonstrated that it could redirect the host-range of retroviral vectors 
carrying 34.1-ecotropic MLV chimeric envelope glycoproteins and allow specific infection 
of human cells (Marin et al. 1996). Therefore the 34.1 scFV was fused to A-RBD or to E- 

25 RBD and preparation of 34.1-A-RBD or 34.1-E-RBD polypeptides were obtained. 
Retroviral vectors carrying wild-type envelope glycoproteins, A, or AdelH fusion-defective 
amphotropic envelope glycoproteins were used to infect target cells in the presence of A- 
RBD, E-RBD, 34.1-E-RBD or 34.1-A-RBD polypeptides (Table 3). While either 
polypeptides could only weakly decrease the infectivity of parental retroviruses, they could 

30 strongly stimulate the infectivity of AdelH retrovirus. No significant differences could be 
detected when comparing the A-RBD, the E-RBD, the 34.1-E-RBD or the 34.1-A-RBD 
polypeptides in activation of AdelH retrovirus infectivity (Table 3). 



BNSDOCID: <WO 0071578A2_I_> 



WO 00/7 1 578 PCT/EP00/04534 

32 



Table 3. Modulation of virus-cell fusion by ligand-displaying 
envelope fragments 



Envelope 


RBD fragment 


Titers 


A 


- 


8.1xl0 6 




A-RBD 


5.6xl0 5 




E-RBD 


8xl0 6 




34.1-A-RBD 


1.35xl0 6 




34.1-E-RBD 


3xl0 6 


AdelH 




2xl0 ! 




A-RBD 


8.5xl0 4 




E-RBD 


8xl0 5 




34.1-A-RBD 


7.3xl0 4 




34.1-E-RBD 


7xl0 5 



20 These data therefore suggest that it is possible to control the fusion activity of FMGs by 
adding specific ligands on the activating polypeptides. Therefore the design of a specific 
system to eliminate unwanted cells may consist in a delH FMG associated to an RBD 
polypeptide that displays a ligand that can specifically recognise a receptor expressed on the 
target cells. Table 4 shows that 34.1-A-RBD and 34.1-E-RBD could activate the cell-cell 

25 fusion activity of the PROMOdelHRless FMG with an efficiency similar to that of A-RBD 
or E-RBD. 

Altogether these results indicate that since the 34.1-E-RBD or 34.1-A-RBD 
polypeptides could cross-activate the fusogenicity of AdelH, AdelHRless, or 
30 PROMOdelHRless envelope glycoproteins on cells that express the targeted antigen (the 
MHC class I molecules) but no receptor corresponding to either the chimeric RBD or the 
delH retrovirus (Tables 3 & 4), efficient retargeted infection or cell-type specific syncytia 
formation could be obtained. 

35 
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Table 4. Modulation of cell-cell fusion by 
Iigand-displaying envelope fragments 



Envelope 


K±>L> 


Syncytia 


A 

A. 




18 


i\. XV J Coo 




4464 


f\ Xv 1 C O O Kl C 1 XI 




14 

IT 




RBD 


1014 


PROMORless 




14796 


PROMORlessdelH 




304 




A-RBD 


18920 




E-RBD 


17000 




34.1-A-RBD 


19000 




34.1-E-RBD 


20000 



Example 4: Activation of Env fusogenicity by RBD and C/TM domains 
20 interaction 

The H5del mutation was introduced into the CMO, the TMO and the CTMMO 
MLV amphotropic envelope glycoprotein (Lavillette et al. f in preparation), derived from the 
MLV 4070A envelope glycoprotein by substitution of the SU carboxy-teiroinal domain (C 

25 domain) and/or TM ectodomain with those derived from the MLV ecotropic envelope 
glycoprotein (Figure 29 A). Recent evidence from the literature suggest that the C domain of 
MLV envelope glycoproteins is involved in activation and/or control of the fusion 
machinery contained in the TM subunit of the envelope glycoprotein (Nussbaum et al., 
1993; Pinter et ah, 1997). Retroviruses carrying either the parental or the delH-mutated 

30 CMOdelH, TMOdelH and CTMMOdelH chimeric envelope glycoproteins were prepared 
and their infectivity was assayed in the presence or the absence of A-RBD or E-RBD 
soluble polypeptides (Table 5). While the infectivity of AdelH control retroviruses could 
easily be rescued when either A-RBD or E-RBD polypeptides were present in the infection 
mixture (Lavillette et al. s 2000), the infectivity of CMOdelH, TMMOdelH and 

35 CTMMOdelH retroviruses could not be rescued with the A-RBD polypeptide although it 
was efficiently rescued with the E-RBD polypeptide (Table 5). These results indicated that 
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the fusogenicity of the CMOdelH, TMMOdelH and CTMMOdelH envelope glycoproteins 
could only be activated by soluble RBD polypeptides derived from the same envelope 
glycoprotein from which the C and/or TM domain were inserted in the delH chimera. 
Further evidence for the requirement of a compatibility between the RBD and the C and/or 
5 TM domains were provided by the finding that the CTMMO retroviruses displayed a 
significantly slower kinetic of infection comparatively to that of retroviruses carrying wild- 
type MLV amphotropic envelope glycoproteins (Figure 29B). In the absence of exogenous 
RBD in the infection mixture, the replication retroviruses carrying the CTMMO envelope 
glycoproteins was delayed, most likely owing to the unability of the MLV 4070A-derived 

10 RBD of the latter to activate their MoMLV-derived C and/or TM domains. Yet the addition 
in the infection mixture of exogenous RBD polypeptides derived from the ecotropic 
MoMLV envelope glycoprotein (E-RBD), matching the C and/or TM domains of the 
CTMMO chimera could restore the infection kinetic of CTMMO retroviruses to a level 
similar if not identical to that of wild-type amphotropic retroviruses (Figure 29B). In 

15 contrast the kinetic of infection of retroviruses carrying wild-type amphotropic envelope 
glycoproteins was not increased in the presence of the E-RBD polypeptide. Additionally, 
infectivity with retroviruses carrying either wild-type amphotropic or CTMMO envelope 
glycoproteins was strongly decreased or inhibited, respectively, when infection was 
performed in the presence of A-RBD (Figure 29B). This inhibitory effect was most likely 

20 due to A-RBD-mediated partial blocking of amphotropic receptors in the case of wild-type 
amphotropic envelope glycoproteins and both receptor interference and incompatibility 
between this MLV amphotropic Env-derived fragment and the ecotropic C domain in the 
case of CTMMO chimera. 



25 



Table 5 

Infectivity in the presence of : 

30 
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no KJ5JJ 


A T>T>T\ 

A-KUD 


Jk-JKx>L> 


IU 


in 4 


IU 


in 1 


in 5 


IU 


IU 


in 2 

IU 


IU 


in 0 
10 


IU 


in 5 
10 


10 


ia3 

IU 


10 


in 0 


IU 


in 5 


10 6 


10 1 


10 6 


10° 


10° 


10 5 


10 6 


10 6 


10 6 



Example 5: Retargeted gene delivery using RBD fragments. 

We have therefore demonstrated that the RBD of a retroviral envelope glycoprotein 
5 is not only a domain responsible of attaching the viral particle to a cell surface receptor but 
is also a domain that is responsible, upon receptor binding, of fusion activation most likely 
via an interaction with the carboxy-terminal domain of the SU Env subunit (C domain). The 
RBD determinant responsible of this activation appears to be located at its amino-terminal 
end and can be disrupted by the delH mutation. Thus receptor binding of RBD most 

10 probably promotes a conformational change of this amino-terminal end which, in turn, is 
rendered competent for inducing further conformational changes in the Env C domain that 
control the fusion machinery located in the TM Env subunit. We surmised that the 
replacement of the RBD of a retroviral envelope glycoprotein by a cell type specific ligand 
may redirect the attachment of viral particles carrying such targeting envelope 

15 glycoproteins, yet specific attachment of these virions would not promote fusion activation 
due to the absence in the novel ligand of a molecular device responsible of fusion activation. 
Such an activation, leading to infection, may therefore be restored by providing in trans a 
soluble RBD in the infection mixture. 

Data in Table 6 demonstrate that retroviral vectors carrying a chimeric envelope 

20 glycoprotein in which the wild-type RBD is replaced by a single-chain antibody against 
human class I MHC molecules cannot infect human cells although they can efficiently bind 
to ciass I MHC molecules expressed at the surface of the target cells. The addition in trans 
of RBD polypeptides during infection can promote the events necessary to allow infection of 
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the target cells, demonstrating that retargeted binding of chimeric viral particles can give 
rise to productive infection and/or gene delivery when the fusion domains of the targeting 
envelope are activated by an exogenous polypeptide. 

5 

Table 6. RBD-mediated activation of the fusion activity of retargeted envelope 
glycoproteins 



Enva 




+ RBDb 


+ RBD & competitor 0 


A 


106 


105 


105 


34.1-A 


<101 


105 


102 



a: LacZ retroviral vectors were generated with the indicated envelope glycoproteins. A, 
15 wild-type amphotropic Env; 34.1-A, a chimeric envelope glycoprotein derived from the 
MLV amphotropic Env in which the receptor binding domain was replaced by a single 
chain antibody (scFv) against human class I MHC molecule (Marin et al, 1995). The 
vectors were used to infect class I positive human cells. Titers as lacZ i.u./ml. 
b: A preparation of A-RBD polypeptide was added during infection. Similar results were 
20 obtained when the A-RBD polypeptide was added before or after infection with the 34.1-A 
retroviruses. 

c: Anti-MHC-I IgGs, corresponding to the Mab which was used to derive the anti-MHC-I 
scFv inserted into the 34.1-A chimera, were used as competitor in this infection assay. The 
anti-MHC-I IgGs were used to block the targeted antigens before infection with the 34.1-A 
25 retroviruses. 
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CLAIMS 

1. Use of at least one activating polypeptide comprising an amino acid 
sequence corresponding to : 

- the amino acid sequence of the surface subunit (SU) of an envelope 
glycoprotein of a first virus, or of a first retrovirus, or 

- an amino acid sequence derived from said surface subunit, or 

- a fragment of an amino acid sequence of said surface subunit, encompassing 
the receptor binding domain (RBD) and the amino terminal part of the surface 
subunit, 

for the rescue of the fusion property of a fusion defective envelope glycoprotein, 
which has the property of attachment to target cells and is inactive with respect to the 
fusion with said target cells, said fusion defective envelope glycoprotein belonging to 
a second virus or a second retrovirus of the same type as or of a different type from 
the above designated "first virus or first retrovirus". 

2. Use of an activating polypeptide according to claim 1, wherein the 
activating polypeptide recognizes substantially the same receptors of the target cells as 
the fusion defective envelope glycoprotein or wherein the activating polypeptide 
recognizes receptors of the target cells which are different from the ones recognized 
by the fusion defective envelope glycoprotein. 

3. Use according to any one of the claims 1 or 2, wherein the activating 
polypeptide and/or the fusion defective envelope glycoprotein is (are) linked to a 
ligand, which is specific with respect to some receptors of the target cells, the ligand 
being preferably linked to the activating polypeptide. 

4. Use according to claim 3, wherein the ligand is chosen among molecules 
liable to recognize receptors of target cells, for instance receptors of tumoral cells, 
such as EGF, FGF, VEGF. 



WO 00/71578 



41 



PCT/EPOO/04534 



5. Use according to any one of claims 1 to 4, wherein the activating 
polypeptide is under the form of a product as such, or is expressed by a host, such as 
a virus or a cell, or is expressed, in particular by the target cells, transformed by 

5 nucleotide sequences coding for said activating polypeptide. 

6. Use according to any one of claims 1 to 5, wherein the fusion defective 
envelope glycoprotein is expressed by a host, such as a virus incorporating in its 
envelope said fusion defective envelope glycoprotein, or such as a cell, transformed 

10 by nucleotide sequences coding for said fusion defective envelope glycoprotein. 

7. Use according to any one of claims 1 to 6, wherein the activating 
polypeptide corresponds to a fragment of about 200 to about 250 amino acids of said 
surface subunit comprising the receptor binding domain (RBD) and comprising the 

15 first amino acids of the amino terminal part of the surface subunit, the number of said 

first aminoacids varying from about 10 to about 15, in particular comprising the RBD 
and the PHQ peptide, in particular the fragments corresponding to amino acid 
sequences derived from type C mammalian retroviruses such as: 
- Murine leukemia virus such as : 
20 . amphotropic 

. ecotropic 
. xenotropic 
. polytropic 
.10A1MLV 

25 - or GALV type retroviruses and related retrovirus such as SSAV, FELV-A, 

FELV-B, FELV-C 

in particular wherein the activating polypeptide corresponds to one of the 
following amino acid sequences : 
A-RBD 

30 E-RBD 

GALV-RBD 
34-1-A-RBD 
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34-1-E-RBD 

34-1-GALV-RBD 

EGF-A-RBD 

EGF-E-RBD 

EGF-GALV-RBD 

8. Use according to any one of claims 1 to 7, wherein the fusion defective 
envelope glycoprotein corresponds to the amino acid sequence of a mutated form of an 
envelope glycoprotein of a virus or a retrovirus which, in its non mutated form, 
presents attachment and fusion properties with respect to target cells, said fusion 
defective envelope glycoprotein being advantagously derived from type C mammalian 
retroviruses such as: 

- Murine leukemia virus such as : 

. amphotropic 
. ecotropic 
. xenotropic 
. polytropic 
. 10 Al MLV 

- or GALV type retroviruses and related retrovirus such as SSAV, FELV-A, 
FELV-B, FELV-C and comprising : 

* one or several mutations in the amino terminal part of its surface subunit, 
particulary in the first amino acids of the amino terminal part of the surface subunit, 
the number of said first aminoacids varying from about 10 to about 15, said mutations 
inactivating the fusion property without substantially affecting the attachment 
property, in particular comprises one or several mutations within the PHQ amino- 
terminal sequence, wherein in particular the fusion defective envelope glycoprotein 
corresponds to one of the following amino acid sequences : 

- MO R less del H 

- MO del H 

- A R less del H 

- AdelH 
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- PROMO R less del H 

- PROMOdelH 

- GALV R less del H 

- GALVdelH 

- C2 R less del H 

- C2delH 

- C MOdelH 

- TM MOdelH 

- C TM MOdelh 



10 



* or the replacement of the ammo-terminal domain of its surface subunit (RBD) 
by a ligand such as a growth factor, or a cytokine or a single chain antibody, the 
extent of said mutations inactivating the fusion property and the attachement property 
to the normal viral receptor, in particular wherein the fusion defective envelope 

15 glycoprotein corresponds to the following amino-acid sequence : 

-34.1-A 

* or the replacement of its SU carboxy-terminal domain and / or its TM 
ectodomain, and preferably its SU carboxy-terminal domain by the corresponding SU 

20 carboxy-terminal domain and / or the TM ectodomain of another "second" 

glycoprotein derived from type C mammalian retroviruses, with the proviso that said 
fusion defective glycoprotein and said second glycoprotein are not derived from the 
same type C mammalian retroviruses, in particular wherein the fusion defective 
envelope glycoprotein corresponds to one of the following amino-acid sequences : 
25 - CMO 

-TM MO 
- C TM MO 

9. Use of at least one activating polypeptide and of at least one fusion defective 
30 envelope glycoprotein for the preparation of a drug useful for the treatment of cancer 

and infectious diseases. 



BNSDOC1D: <WO 0071578A2_I_> 



WO 00/71578 PCT/EP00/04534 

44 

10. Products containing an activating polypeptide and a fusion defective 
envelope glycoprotein as defined in any one of claims 1 to 9, as a combined 
preparation for simultaneous, separate, or sequential use for the rescue of the fusion 
defective property of said fusion defective envelope glycoprotein. 
5 11. Products containing an activating polypeptide and a fusion defective 

envelope glycoprotein as defined in any one of claims 1 to 9, as a combined 
preparation for simultaneous, separate or sequential use for the treatment of cancer 
pathologies. 

10 12. Activating polypeptides corresponding to a fragment of about 200 to about 

250 amino acids of said surface subunit comprising the RBD and the PHQ peptide, in 
particular the fragments corresponding to amino acid sequences derived from type C 
mammalian retroviruses such as: 

- Murine leukemia virus such as : 
15 . amphotropic 

. ecotropic 
. xenotropic 
. polytropic 
. 10 Al MLV 

20 - or GALV type retroviruses and related retrovirus such as SSAV, FELV-A, 

FELV-B, FELV-C 

in particular wherein the activating polypeptide corresponds to one of the 
following amino acid sequences : 
A-RBD 

25 E-RBD 

GALV-RBD 

34-1-A-RBD 

34-1-E-RBD 

34-1-GALV-RBD 
30 EGF-A-RBD 

EGF-E-RBD 

EGF-GALV-RBD 
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13. Fusion defective envelope glycoprotein to the amino sequence of an 
envelope glycoprotein of a virus or a retrovirus which in its native state presents 
attachment and fusion properties with respect to target cells, and comprises one or 
several mutations in the amino terminal part of its surface subunit, inactivating the 
fusion property without substantially affecting the attachment property, in particular 
comprises one or several mutations within the PHQ amino sequence of the amino 
terminal part of its surface subunit, wherein in particular the fusion defective envelope 
glycoprotein is derived from type C mammalian retroviruses such as: 

- Murine leukemia virus such as : 

. amphotropic 
. ecotropic 
. xenotropic 
. polytropic 
. 10 Al MLV 

- or GALV type retroviruses and related retrovirus such as SSAV, FELV-A, 
FELV-B or FELV-C 

in particular wherein the fusion defective envelope glycoprotein corresponds to 
one of the following amino acid sequences : 

- AdelH 

- PROMOdelH 

- GALVdelH 

- C2delH 

- A R less del H 

- PROMO R less del H 

- GALV R less del H 

- C2 R less del H 

- C MO 

- TM MO 

- C TM MO 

- C MOdelH 

- TM MOdelH 



WO 00/71578 PCT/EPOO/04534 

46 

- C TM MOdelh 

- 34.1-A 

with the proviso that they are different from the ecotropic MLV sequence, wherein the 
amino acids in position 2 to 8 are either substituted or deleted. 
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A-RBD Sequence 



10 20 30 40 ~ so TiT 

123 456 789 012 345 678 901 234 567 890 123 456 789 m 9 335 678 9m 234 567 890 
ATG GCG CGT TCA ACG CTC TCA AAA CCC CCT CAA GAT AAG ATT AAC CCG TGG AAG CCC TTA " 
Met Ala Arg Ser Thr Leu Ser Lys Pro Pro Gin Asp Lys He Asn Pro 5 ^ S Su 



60 



ATA GTC ATG GGA GTC CTG TTA GGA GTA GGG ATG GCA GAG AGC CCC CAT CAG GTC TTT AAT 
He Val Met Gly Val Leu Leu Gly Val Gly Met Ala Glu Ser Pro His Gin Val Phe Asn 

GTA ACC TGG AGA GTC ACC AAC CTG ATG ACT GGG CGT ACC GCC AAT GCC ACC TCC CTC CTG 
Val Thr Trp Arg Val Thr Asn Leu Met Thr ply Arg Thr Ala Asn Ala Thr Ser Leu Leu 

GGA ACT GTA CAA GAT GCC TTC CCA AAA TTA TAT TTT GAT CTA TGT GAT CTG GTG rn rnc 
Gly Thr Val Gin Asp Ala Phe Pro Lys Leu Tyr Phe Asp Leu £ £ Su vS £ Su 

GAC CCT TCA GAC CAG GAA CCG TAT GTC GGG TAT GGC TGC AAG TAC CCC GCA GGG 
Glu Trp Asp Pro Ser Asp Gin Glu Pro Tyr Val Gly Tyr Gly Cys Lys Tyr Pro Ala Gly 

AGA CAG OGG ACC CGG ACT TTT GAC TTT TAC GTG TGC CCT GGG CAT ACC GTA AAG TCG GGG 
Arg Gin Arg Thr Arg Thr Phe Asp Phe Tyr Val Cys Pro Gly His Thr Val Lys Ser Gly 

TCTGGGGGACCAGGAGAGGGCTACTGTGGTAAATGG GGG' TGT GAA ACC ACC GGA CAG GCT 
Cys Gly Gly Pro Gly Glu Gly Tyr Cys Gly Lys Trp Gly Cys Glu £ S ™ Sn Sa 

AAG OCC ACA TCA TOG TGG GAC CTA ATC TCC CTT AAG OGC GGT AAC ACC CCC TGG 
Tyr Tip Lys Pro Thr Ser Ser Trp Asp Leu ^ e ^ e3 - Leu Lys Arg Gly Asn Thr Pro Trp 

GAC ACG GGA TGC TCT AAA GTT GCC TGT GGC CCC TGC TAC GAC CTC TCC 211 era rrrv- 
Asp Thr Gly Cys Ser Lys Val Ala Cys Gly Pro Cys Tyr Asp Leu Sex Lys val Ser Asn 

TCC TTC CAA GGG GCT ACT CGA GGG GGC AGA TGC AAC rv-r nn* ^ ~ 

Al^ AAA AAG GCT AAC TGG GAC OGG CCC AAA TCG TGG GGA CTG AGA CTG TAC CGG ACA 
Ala Gly Lys Lys Ala Asn Trp Asp Gly Pro Lys Ser Trp Gly Leu Arg Leu Tyr Arg £ 

Ph^" ATC TCA GTA GTC CAG GCT TTA GTC CTG ACT CAA CAA TAC CAC CAG 

Phe Val Lys Asp Arg He Ser Val Val Gin Ala Leu Val Leu Thr G^n 'Gin Ty^ His Gin 

CTA AAA CCA CTA GAA TAC GAG CCA TGA 

Leu Lys Pro Leu Glu Tyr Glu Pro ... 867 
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E-RBD- Sequence 



10 20 30 40 50 60 

123 456 789 012 345 678 901 234 567 890 123 m -7 89 012 3d c; C78 901 23< ^ 8go 

ATG GCG CGT TCA ACG CTC TCA AAA CCC CTT AAA AAT AAG GTT AAC CCG CGA GGC CCC CTA 60 
Met Ala Arg Ser Thr Leu Ser Lys Pro Leu Lys Asn Lys Val Asn Pro Arg Gly Pro Leu 

ATC CCC TTA ATT CTT CTG ATG CTC AGA GGG GTC AGT ACT GCT TCG CCC GSC TCC ACT CCT 120 
lie pro Leu He Leu Leu Met Leu Arg Gly Val Ser Thr Ala Ser Pro Gly Ser Ser Pro 

CAT CAA GTC TAT AAT ATC ACC TGG GAG GTA ACC AAT GGA GAT COG GAG ACQ GTA TGG GCA 180 
His Gin Val Tyr Asn lie Thr Trp Glu Val' Thr Asn Gly Asp Arg Glu Thr Val Trp Ala 

ACT TCT GGC AAC CAC CCT CTG TGG ACC TGG TGG CCT GAC CTT ACC CCA GAT TTA TGT ATG 240 
Thr Ser Gly Asn His Pro Leu Trp Thr Trp Trp Pro Asp Leu Thr Pro Asp Leu Cys Met 

TTA GCC CAC CAT. GGA CCA TCT TAT TGG GGG CTA GAA TAT CAA TCC CCT TTT TCT -TCT CCC 300 
Leu Ala His Has Gly Pro Ser Tyr Trp Gly Leu Glu Tyr Gin Ser Pro Phe Ser Ser Pro 

dt-o r*T^ ^CK GGG GGC AGC AGC CCA GGC TCT TCC AGA GAC TGC GAA GAA 360 

Pro Gly Pro Pro Cys Cys Ser Gly Gly Ser Ser Pro Gly Cys Ser Arg Asp Cys Glu Glu - 

OCT TTA ACC TCC CTC ACC OCT OGG TGC AAC ACT GCC TOG AAC ASA CTC AAG CTA GAC CAG 420 
Pro Leu Thr Ser Leu Thr Pro Arg Cys Asn Thr Ala Trp Ash Arg Leu Lys Leu Asp Gin 

AAA TCA AAT GAG GGA TTT TAT GTT TGC CCC GGG CCC CAC CGC CCC CGA GAA 480 
Thr Thr Hxs Lys Ser Asn Glu Gly Phe Tyr Val Cys Pro Gly Pro His Arg Pro Arg Glu 

TCC AAG TOT GGG GGT CCA GAC TCC TTC TAC TGT GCC TAT TOG GGC TGT GAG ACA ACC 540 

Ser Lys Ser Cys Gly Gly Pro Asp Ser Phe Tyr Cys Ala Tyr Trp Gly Cys Glu Thr Thr 

•nT'* AAG CCC TCC TCA TCA TGG GAT TTC ATC ACA GTA AAC AAC AAT CTC 600 

Gly Arg Ala Tyr Trp Lys Pro Ser Ser Ser Trp Asp Phe He Thr Val Asn Asn Asn Leu 

ACC TCT GAC CAG GCT GTC CAG GTA TGC AAA GAT AAT AAG TGG TGC AAC CCC TTA GTT ATT 660 
Tnr Ser Asp Gin Ala Val Gin Val Cys Lys Asp Asn Lys Trp Cys Asn Pro Leu V&1 iZ 

wu^ GAC GGG AGA CGG GTT ACT TCC TOG ACC' ACA. GGA CAT TAC TGG GGC TTA 720 

Arg Phe Thr Asp Ala Gly Arg Arg Val Thr Ser Trp Thr Thr Gly His Tyr Trp Gly Leu 

CGT TTG TAT GTC TCC GGA CAA GAT CCA GGG CTT ACA TTT GGG ATC AGA CTC AGA TAC CAA 780 
Arg Leu Tyr Val Ser Gly Gin Asp Pro Gly Leu Thr Phe Gly lie S Su £ Tyr 

AAT CTA GGA CCC CGC GTC CCA ATA GGG ATC CAA TAC CCC TAC GAC GTC CCC GAC TAT GCC 840 
Asn Leu Gly Pro Arg Val Pro He Gly He Gin Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 

AAT CGA TTA GTT CAA TTT GTT AAA GAC AGG ATC TCA GTA GTC CAG GCT TTA GTC CTG ACT 900 
Asn Arg Leu Val Gin Phe Val Lys Asp Arg He Ser Val Val Gin Ala Leu Val Leu Thr 

CAA CAA TAC CAC CAG CTA AAA CCA CTA GAA TAC GAG CCA TGA o 42 
Gin Gin Tyr His Gin Leu Lys Pro Leu Glu Tyr Glu Pro 



FIG. 7 
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GALV-RBD- Sequence 



10 20 30 40 5 o eo 

123 456 789 012 345 678 901 234 5 67 890 123 456 789 013 ?45 678 901 56 7 890 

ATC GTA TTG CTG CCT GGG TCC ATG CTT CTC ACC TCA AAC CTG CAC CAC CTT CGG CAC CAG 
Met Val Leu Leu Pro Gly Ser Met Leu Leu Thr Ser Asn Leu His His Leu Arg His Gin 

ATG AGT CCT GGG AGC TGG AAA AGA CTG ATC ATC CTC TTA AGC TGC GTA TTC GGC GGC GGC 
Met Ser Pro Gly Ser Trp Lys Arg Leu He lie Leu Leu Ser Cys Val Phe Gly Gly Gly 

GGG ACG AGT CTG CAA AAT AAG AAC CCC CAC CAG CCC ATG ACC CTC ACT TGCS' CAG GTA CTG 
Gly Thr Ser Leu Gin Asn Lys Asn Pro His Gin Pro Met Thr Leu Thr Trp Gin Val Leu 

TCC CAA ACT GGA GAC GTT GTC TGG GAT ACA AAG GCA GTC CAG CCC CCT TGG ACT TGG TGG 
Ser Gin Thr Gly Asp Val Val Trp Asp Thr Lys Ala Val Gin Pro Pro Trp Thr Trp Trp 

CCC ACA CTT AAA CCT GAT GTA TGT GCC TTG GCG GCT AGT CTT GAG TCC TGG GAT ATC COG 
Pro Thr Leu Lys Pro Asp Val Cys Ala Leu Ala Ala Ser Leu Glu Ser Trp Asp He Pro 

GGA ACC GAT GTC TCG TCC TCT AAA CGA GTC AGA CCT CCG GAC TCA GAC TAT ACT GCC GCT 
Gly Thr Asp val Ser Ser Ser Lys Arg Val Arg Pro Pro Asp Ser Sp £ t2 Sa SI 

TAT AAG CAA ATC ACC TGG GGA GCC ATA GGG TGC AGC TAG CCT CGG GCT AGG ACT AGA ATG 
Tyr Lys Gin He Thr Trp Gly Ala He Gly Cys Ser Tyr Pro Arg Ala Arg Thr Arg Met 

GCA AGC TCT ACC TTC TAC GTA TGT CCC CGG GAT GGC CGG ACC CTT TCA GAA GCT AGA AGG 
Ala Ser Ser Thr Phe Tyr Val Cys Pro Arg Asp Gly Arg Thr Leu Ser Glu Ala Arg Arg 

TGC GGG GGG CTA GAA TCC CTA TAC TCT AAA GAA TGG GAT 1GT GAG ACC ACG GGG ACC GGT 
Cys Gly Gly Leu Glu Ser Leu Tyr Cys Lys Glu Trp Asp Cys Glu Thr Thr Gly Thr Gly 

TAT TGG CTA TCT AAA TCC TCA AAA GAC CTC ATA ACT GTA AAA TGG GAC CAA AAT AGC GAA 
Tyr Trp Leu Ser Lys Ser Ser Lys Asp Leu He Thr Val Lys Trp Asp Gin Asn Ser Glu 

TGG ACT CAA AAA TTT CAA CAG TGT CAC CAG ACC GGC TGG TGT AAC CCC CTT AAA ATA GAT 
Trp Thr Gin Lys Phe Gin Gin Cys His Gin Thr Gly Trp Cys Asn Pro Leu Lys He Asp 

TTC ACA GAC AAA GGA AAA TTA TCC AAG GAC TGG ATA ACG GGA AAA ACC TGG GGA TTA AGA 
Phe Thr Asp Lys Gly Lys Leu Ser Lys Asp Trp He Thr Gly Lys Thr Trp Gly Leu Arg 

TTC TAT GTC TCT GGA CAT CCA GGC GTA CAG TTC ACC ATT CGC TTA AAA ATC ACC AAC ATC 
Phe Tyr Val Ser Gly His Pro Gly Val Gin Phe Thr He Arg Leu Lys lie Thr Asn Met 

CCA GCT GTC GCA GTA GGG ATC CAA TAC CCC TAC GAC GTC CCC GAC TAT GCC AAT CGA TTA 
Pro Ala Val Ala Val Gly lie Gin Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Asn Arg Leu 

GTT CAA TTT GTT AAA GAC AGG ATC TCA GTA GTC CAG GCT TTA GTC CTC ACT CAA CAA TAC 
Val Gin Phe Val Lys Asp Arg He Ser Val Val Gin Ala Leu Val Leu Thr Gin Gin Tyr 
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EGF-A-RBD- Sequence 



10 20 30 40 50 60 

123 456 789 012 345 678 901 234 567 890 123 456 789 012 345 678 901 234 567 890 

ATG GCG CGT TCA ACG CTC TCA AAA CCC CTT AAA AAT AAG GTT AAC CCG CGA GGC CCC CTA 60 
Met Ala Arg Ser Thr Leu Ser Lys Pro Leu Lys Asn Lys Val Asn Pro Arg Gly Pro Leu 

ATC CCC TTA ATT CTT CTG ATG CTC AGA GGG GTC AGT ACT GCT TCG CCC GGC TCC AGT GCG 120 
lie Pro Leu lie Leu Leu Met Leu Arg Gly Val Ser Thr Ala Ser Pro Gly Ser Ser Ala 

GCC CAG CCG GCC ATG GCC AAT AGT GAC TOT GAA TGT CCC CTG TCC CAC GAT GGG TAC TGC 180 
Ala Gin Pro Ala Met Ala Asn Ser Asp Ser Glu Cys Pro Leu Ser His Asp Gly Tyr Cys 

CTC CAT GAT GGT GTG TGC ATG TAT AIT GAA GCA TTG GAC AAG TAT GCA TGC AAC TGT GTT 240 
Leu His Asp Gly Val Cys Met Tyr He Glu Ala Leu Asp Lys Tyr Ala Cys Asn Cys Val 

GTT GGC TAC ATC GGG GAG CGA TGT CAG TAC CGA GAC CTG AAG TGG TGG GAA CTG CGC GCG 300 
Val Gly Tyr He Gly Glu Arg Cys Gin Tyr Arg Asp Leu Lys Trp Trp Glu Leu Arg Ala. 

GCC GCA ATC GAG GGA AGG ATG GCA GAG AGC CCC CAT CAG GTC TTT AAT GTA ACC TGG AGA 360 
ALa Ala lie Glu Gly Arg Met Ala Glu Ser Pro His Gin Val Phe Asn Val Thr Trp Arg 

GTC ACC AAC CTG ATG ACT GGG CGT ACC GCC AAT GCC ACC TCC CTC CTG GGA ACT GTA CAA 420 
Val Thr Asn Leu Met Thr Gly Arg Thr Ala Asn Ala Thr Ser Leu Leu Gly Thr Val Gin 

GAT GCC TTC CCA AAA TTA TAT TTT GAT CTA TGT GAT CTG GTC GGA GAG GAG TGG GAC CCT 480 
Asp Ala Phe Pro Lys Leu iyr Phe Asp Leu Cys Asp Leu Val Gly Glu Glu Trp Asp Pro 

TCA GAC GAG GAA CCG TAT GTC GGG TAT GGC TGC. AAG TAC CCC GCA GGG AGA CAG CGG ACC 540 
Ser Asp Gin Glu Pro Tyr Val Gly Tyr Gly Cys Lys Tyr Pro Ala Gly Arg Gin Arg Thr 

GGG ACT TTT GAC TTT TAC GTG TGC CCT GGG CAT ACC GTA AAG TOG GGG TGT GGG GGA CCA 600 
Arg Thr Phe Asp Phe Tyr Val Cys Pro Gly His Thr Val Lys Ser Gly Cys Gly Gly Pro 

GGA GAG GGC TAC TGT GGT AAA TGG GGG TGT GAA ACC ACC GGA CAG GCT TAC TGG AAG CCC 660 
Gly Glu Gly Tyr Cys Gly Lys Trp Gly Cys Glu Thr Thr Gly Gin Ala Tyr Trp Lys Pro 

ACA TCA TCG TGG GAC CTA ATC TOC CTT AAG CGC GGT AAC ACC CCC TGG GAC AGG GGA TGC 720 
Thr Ser Ser Trp Asp Leu lie Ser Leu Lys Arg Gly Asn Thr Pro Trp Asp Thr Gly Cys 

TCT AAA GTT GCC TGT GGC CCC TGC TAC GAC CTC TCC AAA GTA TCC AAT TCC TTC CTA GGG 78*0 
Ser Lys Val Ala Cys Gly Pro Cys Tyr Asp Leu Ser Lys Val Ser Asn Ser Phe Gin Gly 

GCT ACT CGA GGG GGC AGA TGC AAC CCT CTA GTC CTA GAA TTC ACT GAT GCA GGA AAA AAG 840 
Ala Thr Arg Gly Gly Arg Cys Asn Pro Leu Val Leu Glu Phe Thr Asp Ala Gly Lys Lys 

GCT AAC TGG GAC GGG CCC AAA TCG TGG GGA CTG AGA CTG TAC COG ACA GGA ACA GAT CCT 900 
Ala Asn Trp Asp Gly Pro Lys Ser Trp Gly Leu Arg Leu Tyr Arg Thr Gly Thr Asp Pro 

ATT ACC ATG TTC TCC CTG ACC CGG CAG GTC CTT AAT GTG GGA CCC CGA GTC CCC ATA GGG 960 
He Thr Met Phe Ser Leu Thr Arg Gin Val Leu Asn Val Gly Pro Arg Val Pro He Gly 

ATC CAA TAC CCC TAC GAC GTC CCC GAC TAT GCC AAT CGA TTA GTT CAA TTT GTT AAA GAC 1020 
He Gin Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Asn Arg Leu Val Gin Phe Val Lys Asp 

AGG ATC TCA GTA GTC CAG GCT TTA GTC CTG ACT CAA CAA TAC CAC CAG CTA AAA CCA CTA 1080 
Arg He Ser Vai Val Gin Ala Leu Val Leu Thr Gin Gin Tyr His Gin Leu Lys Pro Leu 

GAA TAC GAG CCA TGA Q - 

Glu Tyr Glu Pro . . . 

FIG. 9 
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EGF-E-RBD- Sequence 



10 20 ^ 30 



60 
120 
180 
240 
300 



123 456 78 9 012 345 fi7» om 567 B Z „_ ... 50 60 

*e oca cor tca a cg ctc tca aaa ccc cS^a Z ^ Z Al ^ *" ™ ™ 

ATC CCC TTA ATT CTT CTG ATG CTC AGA GGG GTC AGT ACT GCT TCG err r^ - 
Xle Pro Leu He Leu Leu Me t Leu tog Gly Val £ £ S s'S £ Gly S ^ S 

GCC CAG CCG GCC ATG GCC AAT ACT GAT tyt ™* fTV ^ n 

CTC CAT GAT GGT GTG TGC ATG TAT ATT raa /-v-* 

GTT GGC TAC ATC GGG GAG CGA TGT CACZ T*r* rv-.* ™ . 

GCC GCA ATC GAG QGA AGG GCT Trv rw ~rv-, ™ 

TGG GAG GTA ACC AAT GGA GAT CGG GAG ACG GTA TCf: nr-a »~n ~ 
TGG ACC TGG TOG CCT GAC CTT APT rri ™*n r „ 

^ Ser ^ Asp Cys Glu Glu Pro Leu Thr Ser Le U Thr Pro 

5=5=5=5==5=s;s =s== -- = 
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EGF-GALV-RBD- . Sequence 



10 20 30 40 50 60 

123 456 789 012 345 678 901 234 567 890 123 456 789 012 345 678 901 234 567 890 

ATG GOG CGT TCA ACG CTC TCA AAA CCC CTT AAA AAT AAG GTT AAC CCG CGA GGC CCC CTA 60 
Met Ala Arg Ser Thr Leu Ser Lys Pro Leu Lys Asn Lys Val Asn Pro Arg Gly Pro Leu 

ATC CCC TTA ATT CTT CTG ATG CTC AGA GGG GTC AGT ACT GOT TCG CCC GGC TCC AGT GCG 120 
He Pro Leu He Leu Leu Met Leu Arg Gly Val Ser Thr Ala Ser .Pro Gly Ser Ser Ala 

.GCC CAG CCG GCC ATG GCC AAT AGT GAC TCT GAA TGT CCC CTG TCC CAC GAtf-GQG TAC TGC 180 
Ala Gin Pro Ala Met Ala Asn Ser Asp Ser Glu Cys Pro Leu Ser His Asp Gly Tyr Cys 

7 GAA GCA TTG GAC AAG TAT GCA TGC AAC TGT GTT 240 



300 



CTC CAT 


GAT 


GGT 


GTG 


TGC 


ATG TAT 


Leu His 


Asp 


Gly 


Val 


Cys 


Met Tyr 


GTT GGC 


TAC 


ATC 


GGG 


GAG 


CGA TGT 


Val Gly 


Tyr 


lie 


Gly 


Glu 


Arg Cys 


GCC GCA 


ATC 


GAG 


GGA 


AGG 


AGT CTG 


Ala Ala 


He 


Glu 


Gly 


Arg 


Ser Leu 



360 



TOG CAG 
Tip Gin 


GTA 
Val 


CTG 
Leu 


TCC 
Ser 


CAA 
Gin 


ACT GGA 
Thr Gly 


GAC 
Asp 


GTT GTC TGG GAT ACA AAG GCA GTC CAG CCC CCT 
Val Val Trp Asp Thr Lys Ala Val Gin Pro Pro 


420 


TGG ACT 
Trp Thr 


TGG 
Trp 


TGG 
Trp 


CCC 
Pro 


ACA 
Thr 


CTT AAA 
X^eu Lys 


CCT 
Pro 


GAT GTA TGT GCC TTG GCG GOT AGT CTT GAG TCC 
Asp Val Cys Ala Leu Ala Ala Ser Leu Glu Ser 


480 


TGG 
Txp 


GAT 
Asp 


ATC 
Jle 


CCG 
Pro 


GGA 
Gly 


ACC 
Thr 


GAT GTC 
Asp Val 


TCG 
Ser 


TCC TCT AAA CGA GTC AGA CCT CCG GAC TCA GAC 
Ser Ser Lys Arg Val Arg Pro Pro Asp Ser Asp 


540 


TAT 
Tyr 


ACT 
Thr 


GCC 
Ala 


GCT 
Ala 


TAT 

Tyr 


AAG 
Lys 


CAA ATC 
Gin He 


ACC 
Thr 


TGG GGA GCC ATA GGG TGC AGC TAC CCT CGG GCT 
Trp Gly Ala He Gly Cys Ser Tyr Pro Arg Ala 


600 


AGG 
Arg 


ACT 
Thr 


AGA 
Arg 


ATG 
Met 


GCA 
Ala 


AGC 
Ser 


TCT ACC 
Ser Thr 


TTC 
Phe 


TAC GTA TGT CCC CGG GAT GGC CGG ACC CTT TCA 
Tyr Val Cys Pro Arg Asp Gly Arg Thr Lieu Ser 


660 


GAA 
Glu 


GOT 
Ala 


AGA 
Arg 


AGG 
Arg 


TGC 
Cys 


GGG 
Gly 


GGG CTA 
Gly "Leu 


GAA 
Glu 


TCC CTA TAC TGT AAA GAA TGG GAT TGT GAG ACC 
Ser Leu Tyr Cys Lys Glu Trp Asp Cys Glu Thr 


720 


ACG 
Thr 


GGG 
Gly 


ACC 
Thr 


GGT 
Gly 


TAT 
Tyr 


TGG 
Trp 


CTA TCT 
Leu Ser 


AAA 

Lys 


TCC TCA AAA GAC CTC ATA ACT GTA AAA TGG GAC 
Ser Ser Lys Asp Leu He Thr Val Lys Trp Asp 


780 


CAA 
Gin 


AAT 
Asn 


AGC 
Ser 


GAA 
Glu 


TGG 
Trp 


ACT 
Thr 


CAA AAA 
Gin Lys 


TTT 
Phe 


CAA CAG TGT CAC CAG ACC GGC TGG TGT AAC CCC 
Gin Gin Cys His Gin Thr Gly Trp Cys Asn Pro 


840 


CTT 
Lea 


AAA 
Lys 


ATA 
He 


GAT 
Asp 


TTC 
Phe 


ACA 
Thr 


GAC AAA 
Asp Lys 


GGA 
Gly 


AAA TTA TCC AAG GAC TGG ATA ACG GGA AAA ACC 
Lys Leu Ser Lys Asp Trp He Thr Gly Lys Thr 


900 


TGG 
Trp 


GGA 
Gly 


TTA 
Leu 


AGA 

Arg 


TTC 
Phe 


TAT 
Tyr 


GTG TCT 
Val Ser 


GGA 
Gly 


CAT CCA GGC GTA CAG TTC ACC ATT CGC TTA AAA 
His Pro Gly Val Gin Phe Thr He Arg Leu Lys 


960 


ATC 
He 


ACC 
Thr 


AAC 
Asn 


ATG 
Met 


CCA 
Pro 


GCT 
Ala 


GTG GCA 
Val Ala 


GTA 
Val 


GGG ATC CAA TAC CCC TAC GAC GTC CCC GAC TAT 
Gly He Gin Tyr Pro Tyr Asp Val Pro Asp Tyr 


1020 


GCC 
Ala 


AAT 
Asn 


CGA 
Arg 


TTA 
Leu 


GTT 
Val 


CAA 
Gin 


TTT GTT 
Phe Val 


AAA 
Lys 


GAC AGG ATC TCA GTA GTC CAG GCT TTA GTC CTG 
Asp Arg He Ser Val Val Gin Ala Leu Val Leu 


1080 


ACT 
Thr 


CAA 
Gin 


CAA 
Gin 


TAC 
Tyr 


CAC 
His 


CAG 
Gin 


CTA AAA 
Leu Lys 


CCA 
Pro 


CTA GAA TAC GAG CCA TCA 
Leu Glu Tyr Glu Pro 


1125 
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34.1-A-RBD- Sequence 



10 20 30 40 50 60 

123 456 789 012 345 678 901 234 56 7 890 123 A*fj 789 012 345 678 901 2^ ^-7 son 

ATG GCG CGT TCA ACG CTC TCA AAA CCC CTT AAA AAT AAG GTT AAC CCG CGA GGC CCC CTA 60 
Met Ala Arg Ser Thr Leu Ser Lys Pro Leu Lys Asn Lys Val Asn Pro Arg Gly Pro Leu 

ATC CCC TTA ATT CTT CTG ATG CTC AGA GGG GTC AGT ACT GCT TCG CCC GGC TCC AGT GCG 120 
He Pro Leu He Leu Leu Met Leu Arg Gly Val Ser Thr Ala Ser Pro Gly Ser Ser Ala 

GCC CAG CCG GGC ATG GCC CAG GTT CAG CTT CAG CAG TCP GGA GCT GAA CTG GTA AAG CCT 180 
Ala Gin Pro Ala Met Ala Gin Val Gin Leu Gin Gin Ser Gly Ala Glu Leu Val Lys Pro 

GGG GCT TCA GTG AAG TTG TCC TGC AAG GCT TCT GGC TAC ACC TTC ACA AGC TAT GAT ATA 240 
Gly Ala Ser Val Lys Leu Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Ser Tyr Asp He 



300 



360 



AAC TGG GTG AGG CAG AGG CCT GAA CAG GGA CTT GAG TGG ATT GGA TGG ATT TTT CCT GGA 
Asn Trp Val Arg Gin Arg Pro Glu Gin Gly Leu Glu Trp He Gly Tip lie Phe Pro Gly 

GAT GGT AGT ACT AAG TAC AAT GAG AAG TTC AAG GGC AAG GCC ACA CTG ACT ACA GAC AAA 
Asp Gly Ser Thr Lys Tyr Asn Glu Lys Phe Lys Gly Lys Ala Thr Leu Thr Thr Asp Lys 

TCC TCC AGC ACA GCC TAC ATG CAG CTC AGC AGG CTG ACA TCT GAA GAC TCT GCT ATC TAT 420 
Ser Ser Ser Thr Ala Tyr Met Gin Leu Ser Arg Leu Thr Ser Glu Asp Ser Ala He Tyr 

TTC TGT GCA AGA ATG GCC GAC TAT GGT CAC TAC GGC GCT GTG GAC TAC TGG GGT CAA GGG 480 
Phe Cys Ala Arg Met Ala Asp Tyr Gly His Tyr Gly Ala Val Asp Tyr Trp Gly Gin Gly 

'^ CC "^"^ GGT GGA GGC GGT TCA GGC GGA GGT GGC TCT GGC GGT GGC 540 
Thr Thr Val Thr Val Ser Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly 

Gly Ser Asp He Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Leu Gly Asp Arg 

^mf~ ti^ TOC AGG GCA AGT CAG GAC ATT AGC AAT TAT TTA AAC TGG TAT CAG CAG 660 

Val Thr He Ser Cys Arg Ala Ser Gin Asp He Ser Asn Tyr Leu Asn Trp Tyr Gin Gin 

ACA CCA GAT GGA ACT GTT AAA CTC CTG ATC TAC TAC ACA TCA AGA TTA CAC TCA GGA GTC 720 
Thr Pro Asp Gly Thr Val Lys Leu Leu He Tyr Tyr Thr Ser Arg Leu His Ser Sy 55 

CCA TCA AGG TTC AGT GGC AGT GGG TCT GGA ACA GAT TAT TCT CTC ACC ATT AGC AAC CTG 780 
Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Tyr Ser Leu Thr He Ser Asn Leu 

GAG CAA GAA GAT ATT GCC ACT TAC TTT TGC CAA CAG GGT TAT ACG CTT CCG TGG ACG TTC 840 
Glu Gin Glu Asp lie Ala Thr Tyr Phe Cys Gin Gin Gly Tyr Thr Leu Pro Trp Thr Phe 

GGT GGA GGC ACC AAG CTG GAG ATC AAA CGG . GCG GCC GCA ATC GAG GGA AGG ATG GCA GAG 
Gly Gly Gly Thr Lys Leu Glu He Lys Arg Ala Ala Ala He Glu Gly Arg Met Ala Glu 

AGC CCC CAT CAG GTC TTT AAT GTA ACC TGG AGA GTC ACC AAC CTG ATG ACT GGG CGT ACC 
Ser Pro His Gin Val Phe Asn Val Thr Trp Arg Val Thr Asn Leu Met Thr Gly Arg Thr 

GCC AAT GCC ACC TCC CTC CTG GGA ACT GTA CAA GAT GCC TTC CCA AAA TTA TAT TTT GAT 
Ala Asn Ala Thr Ser Leu Leu Gly Thr Val Gin Asp Ala Phe Pro Lys Leu Tyr Phe Asp 

CTA TGT GAT CTG GTC GGA GAG GAG TGG GAC CCT TCA GAC CAG GAA CCG TAT GTC GGG TAT 
Leu Cys Asp Leu Val Gly Glu Glu Trp Asp Pro Ser Asp Gin Glu Pro Tyr Val Gly Tyr 



9.00 
960 
1020 
1080 



FIG. 12 



WO 00/71578 



PCT/EPOO/04534 



13/49 

34.1-A-RBD-. Sequence 



10 20 30 40 50 60 

123 456 789 012 345 678 901 234 567 890 123 456 789 012 345 678 901 234 567 890 

GGC TGC AAG TAC CCC GCA GGG AGA CAG CGG ACC CGG ACT TTT GAC TTT TAC GTG TGC CCT 1140 
Gly Cys I*ys Tyr Pro Ala Gly Arg Gin Arg Thr Arg Thr Phe Asp Phe Tygr Val Cys Pro 

GGG CAT ACC GTA AAG TCG GGG TGT GGG GGA CCA GGA GAG GGC TAC TGT GGT AAA TGG GGG 1200 
Gly His Thr Val Lys Ser Gly Cys Gly Gly Pro Gly Glu Gly Tyr Cys Gly Lys Trp Gly 

TGT GAA ACC ACC GGA CAG GOT TAC TGG AAG CCC ACA TCA TCG TGG GAC CTA ATC TCC CTT 1260 
Cys Glu Thr Thr Gly Gin Ala Tyr Trp Lys Pro Thr Ser Ser Trp Asp Leu He Ser Leu 

AAG CGC GGT AAC ACC CCC TGG GAC ACG GGA TGC TCT AAA GTT GCC TGT GGC CCC TGC TAC 1320 
Lys Arg Gly Asn Thr Pro Trp Asp Thr Gly Cys Ser Lys Val Ala Cys Gly Pro Cys Tyr 

GAC CTC TCC AAA GTA TCC AAT TCC TTC CAA GGG GCT ACT OGA GGG GGC AGA TGC AAC CCT 1380 
Asp Leu Ser Lys Val Ser Asn Ser Phe Gin Gly Ala Thr Arg Gly Gly Arg Cys Asn Pro 

CTA GTC CTA GAA TTC ACT GAT GCA GGA AAA AAG GCT AAC TGG GAC GGG CCC AAA TCG TGG 1440 
X*eu Val Leu Glu Phe Thr Asp Ala Gly Lys Lys Ala Asn Trp Asp Gly Pro Lys Ser Trp 

GGA CTG AGA CTG TAC CGG ACA GGA ACA GAT OCT ATT ACC ATG TTC TCC CTG ACC CGG CAG 1500 
Gly Leu Arg Leu Tyr Arg Inr Gly Thr Asp Pro lie Thr Met Phe Ser Leu Thr Arg Gin 

GTC CTT AAT GTG GGA CCC GGA GTC CCC ATA GGG ATC CAA TAC CCC TAC GAC GTC CCC GAC 1560 
Val Leu Asn Val Gly Pro Arg Val Pro He Gly He Gin Tyr Pro Tyr Asp Val Pro Asp 

TAT GCC AAT OGA TTA GTT CAA TTT GTT AAA GAC AGG ATC TCA GTA GTC CAG GCT TTA GTC 1620 
Tyr Ala Asn Arg Leu Val. Gin Phe Val Lys Asp Arg He Ser Val Val Gin Ala Leu Val 

CTG ACT CAA CAA TAC CAC CAG CTA AAA CCA CTA GAA TAC GAG CCA TGA 1668 
Leu Thr Gin Gin Tyr His Gin Leu Lys Pro Leu Glu Tyr Glu Pro . . . 
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34.1-E-RBD- Sequence 



10 20 30 40 50 60 

123 456 789 012 345 678 901 234 5 67 890 123 456 789 012 345 678 901 234 567 flqn 

ATG GCG CGT TCA ACG CTC TCA AAA CCC CTT AAA AAT AAG GTT AAC CCG CGA GGC CCC CTA 60 
Met Ala Arg Ser Thr Leu Ser Lys Pro Leu Lys Asn Lys Val Asn Pro Arg Gly Pro Leu 

ATC CCC TTA ATT CTT CTG ATG CTC AGA GGG GTC AGT ACT GCT TCG CCC GCfe TCC AGT GCG 120 
He Pro Leu He Leu Leu Met Leu Arg Gly Val Ser Thr Ala Ser Pro Gly Ser Ser Ala 

GCC CAG CCG GCC ATG GCC CAG GTT CAG CTT CAG CAG TCT GGA GCT GAA CTG GTA AAG CCT 180 
Ala Gin Pro Ala Met Ala Gin Val Gin Leu Gin Gin Ser Gly Ala Glu Leu Val Lys Pro 

GGG GCT TCA GTG AAG TTG TCC TGC AAG GCT TCT G3C TAC ACC TTC ACA AGC TAT GAT ATA 240 
Gly Ala Ser Val Lys Leu Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Ser Tyr Asp He 

AAC TGG GTG AGG CAG AGG CCT GAA CAG GGA CTT GAG TGG ATT GGA TGG ATT TTT CCT GGA 
Asn Trp Val Arg Gin Arg Pro Glu Gin Gly Leu Glu Trp He Gly Trp He Phe Pro Gly 



300 
360 



GAT GGT AGT ACT AAG TAC AAT GAG AAG TTC AAG GGC AAG GCC ACA CTG ACT ACA GAC AAA. 
Asp Gly Ser Thr Lys Tyr Asn Glu Lys Phe Lys Gly Lys Ala Thr Leu Thr Thr Asp Lys 

TCC TCC AGC ACA GCC TAC ATG CAG CTC AGC AGG CTG ACA TCT GAA GAC TCT GCT ATC TAT 420 
Ser Ser Ser Thr Ala Tyr Met Gin Leu Ser Arg Leu Thr Ser Glu Asp Ser Ala He Tyr 

TTC TGT GCA AGA ATG GCC GAC TAT GGT CAC TAC GGC GCT GTG GAC TAC TGG GGT CAA GGG 480 
Phe Cys Ala Arg Met Ala Asp Tyr Gly His Tyr Gly Ala Val Asp Tyr Trp Gly Gin Gly 

ntw* fj?^ GRC TCC TCA GGT GSA GGC GGT TCA GGC GGA GGT GGC TCT GGC GGT GGC 540 

Thr Thr val Thr Val Ser Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly 

GGA TCG GAC ATC CAG ATG ACT CAG TCT CCA TCC TCC CTG TCT GCC TCT CTG GGA GAC AGA 600 
Gly Ser Asp He Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Leu Gly Asp Arg 

m?" AGG GCA AGT CAG GAC ATT AGC AAT TAT TTA AAC TGG TAT CAG CAG 660 

Val Thr He Ser Cys Arg Ala Ser Gin Asp He Ser Asn Tyr Leu Asn Trp Tyr Gin Gin 

ACA CCA GAT GGA ACT GTT AAA CTC CTG ATC TAC TAC ACA TCA AGA TTA CAC TCA GGA GTC 720 
Thr Pro Asp Gly Thr Val Lys Leu Leu He Tyr Tyr Thr Ser Arg Leu His Ser Gly Val 

CCA TCA AGG TTC AGT GGC AGT GGG TCT GGA ACA GAT TAT TCT CTC ACC ATT AGC AAC CTG 780 
Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Tyr Ser Leu Thr He Ser Asn Leu 

GAA GAT ATT GCC ACT TAC TTT TGC CAA CAG GGT TAT ACG CTT CCG TGG ACG TTC 840 
Glu Gin Glu Asp He Ala Thr Tyr Phe Cys Gin Gin Gly Tyr Thr Leu Pro Trp Thr Phe 

GGC ACC AAG CTG GAG ATC AAA CGG GCG GCC GCA ATC GAG GGA AGG GCT TCG CCC 900 
Gly Gly Gly Thr Lys Leu Glu He Lys Arg Ala Ala Ala He Glu Gly Arg Ala Ser Pro 

GGC TCC AGT CCT CAT CAA GTC TAT AAT ATC ACC TGG GAG GTA ACC AAT GGA GAT CGG GAG 960 
Gly Ser Ser Pro His Gin Val Tyr Asn He Thr Trp Glu Val Thr Asn Gly Asp Arg Glu 

ACG GTA TGG GCA ACT TCT GGC AAC CAC CCT CTG TGG ACC TGG TGG CCT GAC CTT ACC CCA 1020 
Thr Val Trp Ala Thr Ser Gly Asn His Pro Leu Trp Thr Trp Trp Pro Asp Leu Thr Pro 

GAT TTA TGT ATG TTA GCC CAC CAT GGA CCA TCT TAT TGG GGG CTA GAA TAT CAA TCC CCT 1080 
Asp Leu Cys Met Leu Ala His His Gly Pro Ser Tyr Trp Gly Leu Glu Tyr Gin Ser Pro 
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34.1-E-RBD- Sequence 



10 20 ' 30 40 : i5 To 

123 456 789 012 345 678 901 234 567 890 1 23 456 7flQ pig 345 678 Qm 23 4 567 89 o 

TTT TCT TCT CCC COG GGG CCC CCT TGT TGC TCA GGG GGC AGC AGC CCA GGC TGT TCC AGA 114 0 " 

Phe Ser Ser Pro Pro Gly Pro Pro Cys Cys Ser Gly Gly Ser Ser Pro Gly Cys Ser Arg 



1200 
1260 
1320 



GA.C TGC GAA GAA CCT TTA ACC TCC CTC ACC CCT CGG TGC AAC ACT GCC TGG AAC AGA CTC 
Asp Cys Glu Glu Pro Leu Thr Ser Leu Thr Pro Arg Cys Asn Thr Ala Trp Asn Arg Leu 

ARG CTA GAC CAG ACA ACT CAT AAA TCA AAT GAG GGA TTT TAT GOT TGC CCC QQG CCC CAC 
Lys Leu Asp Gin Thr Thr His Lys Ser Asn Glu Gly Phe Tyr Val Cys Pro Gly Pro His 

OX:CCCCGAGAATCCAAGTCATGTGGGGGTCC*GAC 

Arg Pro Arg Glu Ser Lys Ser Cys Gly Gly Pro Asp Ser Phe Tyr Cys Ala Tyr Trp Gly 

TOT GAG ACA ACC GGT AGA GCT TAG TGG AAG CCC TCC TCA TCA TGG GAT TTC ATC ACA GTA 1380 
cys Glu Thr Thr Gly Arg Ala Tyr Trp Lys Pro Ser Ser Ser Trp Asp S S tS S 

AAC AAC AAT CTC ACC TCT GAC CAG GCT GTC CAG GTA TGC AAA GAT AAT AAG TGG TGC A»f 
Asa Asn Asn Leu Thr Ser Asp Gin Ala Val Gin Val Cys Lys Asp .Asn Lys Trp Cys Asn 

.CCC TTA GTT ATT CGG .TTT ACA GAC GCC GGG AGA CGG GTT ACT TCC TW irr » m /-.nm 
Leu ^' a ^ Arg Phe' Thr Asp Ala Gly Arg Arg Val Thr Ser Trp. Thr Thr Gly His 

33C TGG GGC TTA CGT TTG TAT GTC TCC GGA CAA GAT CCA GGG CTT ACA TTT GGG ATC Af-a 
Tyr Trp Gly Leu Arg Leu Tyr val Ser Gly Gin Asp Pro Gly S tS p£ S S Sg" 

C1C AGA TAC CAA AAT CTA GGA CCC CGC GTC CCA ATA GGG ATC CAA TAC CCC TAC GAC Clf 
L.ex Arg Tyr Gin Asn Leu Gly Pro Arg Val Pro He Gly He Gin Tyr Pro Tyr Asp Val 

CCC GAC TAT GCC, AAT CGA TTA GTT CAA TTT GTT AAA GAC AGG ATC TCA GTA GTC far 1 ry*r* 
Pro Asp Tyr Ala Asn Arg Leu Val Gin Phe Val Lys Asp Arg lie Ser Val Val Gin Ala 

GTC CTC ACT CAA CAA TAC CAC CAG CTA AAA CCA CTA GAA TAC GAG CCA TGA 
Lex val Leu Thr Gin Gin Tyr His Gin Leu Lys Pro Leu Glu Tyr S2 Pro" !^ 
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34.1-GALV-RBD- Sequence 



10 20 30 40 50 60 

123 456 789 012 345 678 901 234 567 890 123 456 789 012 345 678 901 234 567 890 

ATG GGG CGT TCA ACG CTC TCA AAA CCC CTT AAA AAT AAG GTT AAC CCG CGA GGC CCC CTA 60 
Met Ala Arg Ser Thr Leu Ser Lys Pro Leu Lys Asn Lys Val Asn Pro Arg Gly Pro Leu 

ATC CCC TTA ATT CTT CIG ATG CTC AGA GGG GTC AGT ACT GCT TCG CCC GG&.TCC AGT GOG 120 
lie Pro Leu lie Leu Leu Met Leu Arg Gly Val Ser Thr Ala Ser Pro Gly Ser Ser Ala 

GCC CAG CCG GCC ATG GCC CAG GTT CAG CTT CAG GAG TCT GGA GCT GAA CTG GTA AAG CCT 180 
Ala Gin Pro Ala Met Ala Gin Val Gin Leu Gin Gin Ser Gly Ala Glu Leu Val Lys Pro 

GOG GCT TCA GTG AAG TIG TCC TGC AAG GCT TCT GGC TAC ACC TTC ACA AGC TAT GAT ATA 240 
Gly Ala Ser Val Lys Leu Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Ser Tyr Asp lie 

AAC TGG GTG AGG CAG AGG CCT GAA CAG GGA CTT GAG TQG ATT GGA TGG ATT TTT CCT GGA 300 
Asn Trp Val Arg Gin Arg Pro Glu Gin Gly Leu. Glu Trp He Gly Trp lie Phe Pro Gly 

GAT GGT AGT ACT AAG TAC AAT GAG AAG TTC AAG GGC AAG GCC ACA CTG ACT ACA GAC AAA 360 
Asp Gly Ser Thr Lys Tyr Asn Glu Lys Phe Lys Gly Lys Ala Thr Leu Thr Thr Asp Lys 

TCC TCC AGC ACA GCC TAC ATG CAG CTC AGC AGG CTG ACA TCT GAA GAC TCT GCT ATC TAT 420 
Ser Ser Ser Thr Ala Tyr Met Gin Leu Ser Arg Leu Thr Ser Glu Asp Ser Ala He Tyr 

TTC TGT GGA AGA ATG GCC GAC TAT GGT CAC TAC GGC GCT GTG GAC TAC TGG GGT CAA GGG 480 
Phe Cys Ala Arg Met Ala Asp Tyr Gly His Tyr Gly Ala Val Asp Tyr Trp Gly Gin Gly 

ACC ACG GTC ACC GTC TCC TCA GGT GGA GGC GGT TCA GGC GGA GGT GGC TCT GGC GGT GGC 540 
Thr Thr Val Thr Val Ser Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly 

GGA TOG GAC ATC CAG ATG ACT CAG TCT CCA TCC TCC CTG TCT GCC TCT CTG GGA GAC AGA 600 
Gly Ser. Asp He Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Leu Gly Asp Arg 

GTC ACC ATC AGT TGC AGG GCA AGT CAG GAC ATT AGC AAT TAT TTA AAC TOG TAT CAG CAG 660 
Val Thr He Ser Cys Arg Ala Ser Gin Asp He Ser Asn Tyr Leu Asn Trp Tyr Gin Gin 

ACA CCA GAT GGA ACT GTT AAA CTC CTG ATC TAC TAC ACA TCA AGA TTA CAC TCA GGA GTC 720 
Thr Pro Asp Gly Thr Val Lys Leu Leu He Tyr Tyr Thr Ser Arg Leu His Ser Gly Val 

CCA TCA AGG TTC AGT GGC AGT GGG TCT GGA ACA GAT TAT TCT CTC ACC ATT AGC AAC CTG 780 
Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Tyr Ser Leu Thr He Ser Asn Leu 

GAG CAA GAA GAT ATT GCC ACT TAC TTT TGC CAA CAG GGT TAT ACG CTT CCG TGG ACG TTC 840 
Glu Gin Glu Asp He Ala Thr Tyr Phe Cys Gin Gin Gly Tyr Thr Leu Pro Trp Thr Phe 

GGT GGA GGC ACC AAG CTG GAG ATC AAA CGG GOG GCC GCA ATC GAG GGA AGG AGT CTG CAA 900 
Gly Gly Gly Thr Lys Leu Glu He Lys Arg Ala Ala Ala He Glu Gly Arg Ser Leu Gin 

AAT AAG AAC CCC CAC CAG CCC ATG ACC CTC ACT TGG CAG GTA CTG TCC CAA ACT GGA GAC 960 
Asn Lys Asn Pro His Gin Pro Met Thr Leu Thr Trp Gin Val Leu Ser Gin Thr Gly Asp 

GTT GTC TGG GAT ACA AAG GCA GTC CAG CCC CCT TGG ACT TGG TGG CCC ACA CTT AAA CCT 1020 
Val Val Trp Asp Thr Lys Ala Val Gin Pro Pro Trp Thr Trp Trp Pro Thr Leu Lys Pro 

GAT GTA TGT GCC TTG GOG GCT AGT CTT GAG TCC TGG GAT ATC CCG GGA ACC (SAT GTC TCG 1080 
Asp Val Cys Ala Leu Ala Ala Ser Leu Glu Ser Trp Asp He Pro Gly Thr Asp Val Ser 
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34.1-GALV-RBD- Sequence 



10 20 30 40 50 60 

123 456 789 012 345 678 901 234 567 890 123 456 789 012 345 678 901 234 567 890 

ICC TCT AAA CGA GTC AGA COT CCG GAC TCA GAC TAT ACT GCC GCT TAT AAG CAA ATC ACC 1140 
Ser Ser Lys Arg Val Arg Pro Pro Asp Ser Asp Tyr Thr Ala Ala Tyr Lys Gin He Thr 

»# 

TOG GGA GCC ATA GGG TGC AGC TAC CCT CQG GCT AGG ACT AGA ATG GCA AGO TCT ACC TTC 1200 
Trp Gly Ala lie Gly Cys Ser Tyr Pro Arg Ala Arg Thr Arg Met Ala Ser Ser Thr Phe 

TAC GTA TGT CCC CGG GAT GGC CGG'ACC CTT TCA GAA GCT AGA AGG TGC GGG GGG CTA GAA 1260 
Tyr Val Cys Pro Arg Asp Gly Arg Thr Leu Ser Glu Ala Arg Arg Cys Gly Gly Leu Glu 

TCC CTA TAC TGT AAA GAA TOG GAT TGT GAG ACC ACG GGG ACC GGT TAT TOG CTA TCT AAA 1320 
Ser Leu Tyr Cys Lys Glu Trp Asp Cys Glu Thr Thr Gly Thr Gly Tyr Trp Leu Ser Lys 

TCC TCA AAA GAC CTC ATA ACT GTA AAA TGG GAC CAA AAT AGC GAA TGG ACT CAA AAA TTT 1380 
Ser Ser Lys Asp Leu lie Thr Val Lys Trp Asp Gin Asn Ser Glu Trp Thr Gin Lys Phe 

CAA CAG TGT CAC CAG ACC GGC TGG TCT AAC CCC CTT AAA ATA GAT TTC ACA GAC AAA GGA 1440 
Gin Gin Cys His- Gin Thr Gly Trp Cys Asn Pro Leu Lys He Asp Phe Thr Asp Lys Gly 

AAA TTA TCC AAG GAC TGG ATA ACG GGA AAA ACC TGG GGA TTA AGA TTC TAT GTG TCT GGA 1500 
Lys Leu Ser Lys Asp Trp lie Thr Gly Lys Tnr Trp Gly Leu Arg Phe Tyr Val Ser Gly 

CAT CCA GGC GTA CAG TTC ACC ATT CGC TTA AAA ATC ACC AAC ATG CCA GCT GTG GCA Gift. 1560 
His Pro Gly Val Gin Phe Thx Xle Arg Leu Lys He Thr Asn Met Pro Ala Val Ala Val 

GGG ATC CAA TAC CCC TAC GAC GTC CCC GAC TAT GCC AAT CGA TEA GTT CAA TTT GTT AAA 1620 
Gly He Gin. Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Asn Arg Leu Val Gin Phe Val Lys 

GAC AGG ATC TCA GTA GTC CAG GCT TTA GTC CTG ACT CAA CAA TAC CAC CAG CTA AAA CCA 1680 
Asp Arg He Ser Val Val Gin Ala Leu Val Leu Thr Gin Gin Tyr His Gin Leu Lys Pro 

CTA GAA TAC GAG CCA TGA l698 
Leu Glu Tyr Glu Pro - . . 
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AdelH Sequence 



10 20 30 40 
123 4b6 789 012 345 678 901 234 567 890 123 456 789 012 


345 


678 


50 
901 


234 


567 


60 
890 




ATG GCG CGT TCA ACG CTC TCA AAA CCC CCT CAA GAT AAG ATT 
Met Ala Arg Ser Thr Leu Ser Lys Pro Pro Gin Asp Lys He 


AAC 
Asn 


CCG 
Pro 


TGG 
Trp 


AAG 
Lys 


CCC 
Pro 


TTA 
Leu 


60 


ATA GTC ATG GGA GTC CTG TTA GGA GTA GGG ATG GCA GAG AGC 
He Val Met Gly Val Leu Leu Gly Val Gly Met Ala Glu Ser 


CCC 
Pro 


CAG 
Gin 


GTC 
Val 


TTT 
Phe 


AAT 
Asn 


GTA 
Val 


120 


ACC TGG AGA GTC ACC AAC CTG ATG ACT GGG CGT ACC GCC AAT 
Thr Trp Arg Val Thr Asn Leu Met Thr Gly Arg Thr Ala Asn 


GCC 
Ala 


ACC 
Thr 


TCC 
Set 


CTC 
Leu 


CTG 
Leu 


GGA 
Glv 


180 


ACT GTA CAA GAT GCC TTC CCA AAA TTA TAT TTT GAT CTA TGT 
Thr Val Gin Asp Ala Phe Pro Lys Leu Tyr Phe Asp Leu Cys 


GAT 
ASD 


CTG 
Leu 


GTC 
Val 


GGA 
Glv 


GAG 
Glu 


GAG 
Glu 


240 


TGG GAC CCT TCA GAC CAG GAA CCG TAT GTC GGG TAT GGC TGC 
Trp Asp Pro Ser Asp Gin Glu Pro Tyr Val Gly Tyr Gly Cys 


AAG 
Lys 


TAC 
iyr 


GCC 


GCA 

Ala 


GGG 
vjiy 


AGA 
Arg 


300 


CAG CGG ACC CGG ACT TTT GAC TTT TAC GTG TGC CCT GGG CAT 
Gin Arg Thr Arg Thr Phe Asp Phe Tyr Val Cys Pro Gly His 


ACC 
Thr 


GTA 


AAG 
Lys 


TCG 
Ser 


GGG 
Gly 


TGT 
Cys 


360 


GGG GGA CCA GGA GAG GGC TAC TGT GGT AAA TGG GGG TGT GAA 
Gly Gly Pro Gly Glu Gly Tyr Cys Gly Lys Trp Gly Cys Glu 


ACC 
Thr 


ACC 
Thr 


GGA 

mm 

Gly 


CAG 
Gin 


GCT 
Ala 


TAC 

Tyr 


420 


TGG AAG CCC ACA TCA TOG TGG GAC CTA .ATC TCC CTT AAG CGC 
Trp Lys Pro Thr Ser Ser Trp Asp Leu He Ser Leu Lys Arg 


GGT 

vj-Ly 


AAC 
Asn 


ACC 

Thr 


CCC 
Pro 


TGG 
Trp 


GAC 
Asp 


480 


ACG GGA TGC TCT AAA GTT GCC TGT GGC CCC TGC TAC GAC CTC 
Thr Gly Cys Ser Lys Val Ala Cys Gly Pro Cys Tyr Asp Leu 


TCC 

Cor 


AAA 
Lys 


GTA 
val 


TCC 
Ser 


AAT 
Asn 


TCC 
Ser 


540 


TTC CAA GGG GCT ACT CGA GGG GGC AGA TGC AAC CCT CTA GTC 
Phe Gin Gly Ala Thr Arg Gly Gly Arg Cys Asn Pro Leu Val 


CTA 


GAA 


TTC 
r*ne 


ACT 
Thr 


GAT 
Asp 


GCA 
Ala 


600 


GGA AAA AAG GCT AAC TGG GAC GGG CCC AAA TOG TGG GGA CTG 
Gly Lys Lys Ala Asn Trp Asp Gly Pro Lys Ser Trp Gly Leu 


AGA 

A TTT 


CTG 
Leu 


TAC 
Tyr 


CGG 
Arg 


ACA 
Thr 


GGA 
Gly 


660 


ACA GAT CCT ATT ACC ATG TTC TCC CTG ACC CGG CAG GTC CTT 
Thr Asp Pro He Thr Met Phe Ser Leu Thr Arg Gin Val Leu 


AAT 
Asn 


GTG 

VCLJL 


GGA 


CCC 

rXO 


CGA 
Arg 


GTC 
vai 


720 


CCC ATA GGG CCC AAC CCA GTA TTA CCC GAC CAA AGA CTC CCT 
Pro He Gly Pro Asn Pro Val Leu Pro Asp Gin Arg Leu Pro 


TCC 
Ser 


TCA 
Ser 


CCA 
Pro 


ATA 
He 


GAG 
Glu 


ATT 
He 


780 


GTA CCG GCT CCA CAG CCA CCT AGC CCC CTC AAT ACC AGT TAC 
Val Pro Ala Pro Gin Pro Pro Ser Pro Leu Asn Thr Ser Tyr 


CCC 
Pro 


CCT 
Pro 


TCC 
Ser 


ACT 
Thr 


ACC 
Thr 


AGT 
Ser 


840 


ACA CCC TCA ACC TCC CCT ACA AGT CCA AGT GTC CCA CAG CCA 
Thr Pro Ser Thr Ser Pro Thr Ser Pro Ser Val Pro Gin Pro 


CCC 
Pro 


CCA 
Pro 


GGA 
Gly 


ACT 
Thr 


GGA 

Gly 


GAT 
Asp 


900 


AGA CTA CTA GCT CTA GTC AAA GGA GCC TAT CAG GCG CTT AAC 
Arg Leu Leu Ala Leu Val Lys Gly Ala Tyr Gin Ala Leu Asn 


CTC 
Leu 


ACC 
Thr 


AAT 
Asn 


CCC 
Fro 


GAC 
Asp 


AAG 
Lys 


960 


ACC CAA GAA TGT TGG CTG TGC TTA GTG TCG GGA CCT CCT TAT 
Thr Gin Glu Cys Trp Leu Cys Leu Val Ser Gly Pro Pro Tyr 


TAC 
Tyr 


GAA 
Glu 


GGA 
Gly 


GTA 
Val 


GCG 
Ala 


GTC 
Val 


1020 


GTG GGC. ACT TAT ACC AAT CAT TCC ACC GCT CCG GQC AAC TGT 
Val Gly Thr Tyr Thr Asn His Ser Thr Ala Pro Ala Asn Cys 


ACG 
Thr 


GCC 
Ala 


ACT 
Thr 


TCC 
Ser 


CAA 
Gin 


CAT 
His 


1080 


AAG CTT ACC CTA TCT GAA GTG ACA* GGA CAG GGC CTA TGC ATG 
Lys Leu Thr Leu Ser Glu Val Thr Gly Gin Gly Leu Cys Met 


GGG 
Gly 


GCA 
Ala 


GTA 
Val 


CCT 
Pro 


AAA 
Lys 


ACT 
Thr 


1140 


CAC CAG GCC TTA TGT AAC ACC ACC CAA AGC GCC GGC TCA GGA 
His Gin Ala Leu Cys Asn Thr Thr Gin Ser Ala Gly Ser Gly 


TCC 
Ser 


TAC 
Tyr 


TAC 
Tyr 


CTT 
Leu 


GCA 
Ala 


GCA 
Ala 


1200 
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CCC GCC GGA ACA ATG TGG GCT TGC AGC ACT GGA TTG ACT CCC TCC TTG TCC ACC ACG GTG 1260 
Pro Ala Gly Thr Met Trp Ala Cys Sex Thr Gly Leu Thr Pro Cys Leu Sex Thr Thr Val 

CTC AAT CTA ACC ACA GAT TAT TCT GTA TTA GTT GAA CTC TGG CCC AGA G#A ATT TAC CAC 1320 
Leu Asn Leu Thr Thr Asp Tyr Cys Val Leu Val Glu Leu Trp Pro Arg Val lie Tyr His 

TCC CCC GAT TAT ATG TAT GGT CAG CTT GAA CAG CGT ACC AAA TAT AAA AGA GAG CCA GTA 1380 
Ser Pro Asp Tyr Met Tyr Gly Gin Leu Glu Gin Arg Thr Lys Tyr Lys Arg Glu Pro Val 

.TCA TTG ACC CTG GCC CTT CTA CTA GGA GGA TTA ACC ATG GGA GGG ATT GCA GCT GGA ATA 1440 
Ser Leu Thr Leu Ala Leu Leu Leu Gly Gly Leu Thr Met Gly Gly lie Ala Ala Gly lie 

GGG ACG GGG ACC ACT GCC TTA ATT AAA ACC CAG CAG TTT GAG CAG CTT CAT GCC GCT ATC 1500 
Gly Thr Gly Thr Thr Ala Leu He Lys Thr Gin Gin Phe Glu Gin Leu His Ala Ala He 

CAG ACA GAC CTC AAC GAA GTC GAA AAG TCA ATT ACC AAC CTA GAA AAG TCA CTG ACC TOG 1560 
Gin Thr Asp Leu Asn Glu. Val Glu Lys Ser He Thr Asn Leu Glu Lys Ser lieu Thr Ser 

TTG TCT GAA GTA GTC CTA CAG AAC CGC AGA GGC CTA GAT TTG CTA TTC CTA AAG GAG GGA 1620 
Leu Ser. Glu Val Val Leu Gin Asn Arg Arg Gly Leu Asp Leu Leu Phe Leu Lys Glu Gly 

GST CTC TGC GCA GCC CTA AAA GAA GAA TGT TGT TTT TAT GCA GAC CAC ACG GGG CTA GTG 1680 
Gly Leu Cys Ala Ala Leu Lys Glu Glu Cys Cys Phe Tyr Ala Asp His Thr Gly Leu Val 

AGA GAC AGC ATG GCC AAA TTA ASA GAA AGG CTT AAT CAG AGA CAA AAA CTA TTT GAG ACA 1740 
-Arg Asp Ser Met .Ala Lys Leu Arg Glu "Arg Leu Asn Gin Arg Gin Lys Leu Phe Glu Thr 

GGC CAA GGA TGG TTC GAA GGG CTG TTT AAT AGA TCC CCC TGG TTT ACC ACC TTA ATC TCC 1800 
Gly Gin Gly Trp Che Glu Gly Leu Phe Asn Arg Ser Pro Trp Phe Thr Thr Leu He Ser 

ACC ATC ATG GGA CCT CTA ATA CTA CTC TTA CTG ATC TTA CTC TTT GGA CCT TGC ATT CTC 1860 
Thr He Met Gly Pro Leu He Val Leu Leu Leu He Leu Leu Phe Gly Pro Cys He Leu 

AAT GGA TTA GTT CAA TTT GTT AAA GAC AGG ATC TCA GTA GTC CAG GCT TTA GTC CTG ACT 1920 
Asn Arg Leu Val Gin Phe Val Lys Asp Arg He Ser Val Val Gin Ala Leu Val Leu Thr 

CAA CAA TAC CAC CAG CTA AAA CCA CTA GAA TAC GAG CCA 1959 
Gin Gin Tyr His Gin Leu Lys Pro Leu Glu Tyr Glu Pro 
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50 
901 


234 


567 


60 
890 




ATG GCG CGT TCA ACG CTC TCA AAA CCC CCT CAA GAT AAG ATT AAC CCG 
Met Ala Arg Ser Thr Leu Ser Lys Pro Pro Gin Asp Lys lie Asn Pro 


TGG 
Trp 


AAG 
Lys 


CCC 
Pro 


TTA 
Leu 


60 


ATA GTC ATG GGA GTC CTG TTA GGA GTA GGG ATG GCA GAG AGC CCC CAG 
lie Val Met Gly Val Leu Leu Gly Val Gly Met Ala Glu Ser Pro Gin 


GTC 
Val 


TTT 
Phe 


AAT 
Asn 


GTA 
Val 


120 


ACC TGG AGA GTC ACC AAC CTG ATG ACT GGG CGT ACC GCC AAT GCC ACC 
Thr Trp Arg Val Thr Asn Leu Met Thr Gly Arg Thr Ala Asn Ala Thr 


TCC 
Ser 


CTC 
Leu 


CTG 
Leu 


GGA 
Gly 


180 


ACT GTA CAA GAT GCC TTC CCA AAA TTA TAT TTT GAT CTA TOT GAT CTC 
Thr Val Gin Asp Ala Phe Pro Lys Leu Tyr Phe Asp Leu Cys Asp Leu 


GTC 
Val 


GGA 
Gly 


GAG 
Glu 


GAG 
Glu 


240 


TGG GAC CCT TCA GAC CAG GAA COG TAT GTC GGG TAT GGC TGC AAG TAG 
Trp Asp Pro Ser Asp Gin Glu Pro Tyr Val Gly Tyr Gly Cys Lys Tyr 


CCC 
Pro 


GCA 
Ala 


GGG 
Gly 


AGA 
Arg 


300 


CAG CGG ACC CGG ACT TTT GAC TTT TAC GTG TGC CCT GGG CAT ACC GTA 
Gin Arg Thr Arg Thr Phe Asp Phe Tyr Val Cys Pro Gly His Thr Val 


AAG 
Lys 


TOG 
Ser 


GGG 
Gly 


TGT 
Cys 


360 


GGG GGA CCA GGA GAG GGC TAC TGT GGT AAA TGG ' GGG TGT GAA ACC ACC 
Gly Gly Pro Gly Glu Gly Tyr Cys Gly Lys Trp Gly Cys Glu Thr Thr 


GGA 
Gly 


CAG 
Gin 


GCT 
Ala 


TAC 
Tyr 


420 


TGG AAG CCC ACA TCA TOG TGG GAC CTA ATC TCC CTT AAG CGC GGT AAC 
Tri> Lys Pro Thr Ser Ser Trp Asp Leu He Ser Leu Lys Arg Gly Asn 


ACC 
Thr 


CCC 
Pro 


TGG 
Trp 


GAC 
Asp 


480 


ACG GGA TGC TCT AAA GTT GCC TGT GGC CCC TGC TAC GAC CTC TCC AAA 
Thr Gly Cys Ser Lys Val Ala Cys Gly Pro Cys Tyr Asp Leu Ser Lys 


GTA 
Val 


TCC 
Ser 


AAT 
Asn 


TCC 
Ser 


540 


TTC CAA GGG GOT ACT CGA GGG GGC AGA TGC AAC CCT CTA GTC CTA GAA 
Phe Gin Gly Ala Thr Arg Gly Gly Arg Cys Asn Pro Leu Val Leu Glu 


TTC 
Phe 


ACT 
Thr 


GAT 
Asp 


GCA 
Ala 


600 


GGA AAA AAG GCT AAC TGG GAC GGG CCC AAA TOG TGG GGA CTG AGA CTC 
Gly Lys Lys Ala Asn Trp Asp Gly Pro Lys Ser Trp Gly Leu Arg Leu 


TAC 
Tyr 


CGG 
Arg 


ACA 
Thr 


GGA 
Gly 


660 


ACA GAT CCT ATT ACC ATG TTC TCC CTG ACC CGG CAG GTC CTT AAT GTG 
Thr Asp Pro He Thr Met Phe Ser Leu Thr Arg Gin Val Leu Asn Val 


GGA 
Gly 


CCC 
Pro 


CGA 
Arg 


GTC 
Val 


720 


CCC ATA GGG CCC AAC CCC GTT CTG GCA GAC CAA CAG CCA CTC TCC AAG 
Pro He Gly Pro Asn Pro Val Leu Ala Asp Gin Gin Pro Leu Ser Lys 


CCC 
Pro 


AAA 

Lys 


CCT 
Pro 


GTT 
Val 


780 


AAG TCG CCT TCA GTC ACC AAA CCA CCC AGT GGG ACT CCT CTC TCC CCT 
Lys Ser Pro Ser Val Thr Lys Pro Pro Ser Gly Thr Pro Leu Ser Pro 


ACC 
Thr 


CAA 
Gin 


CTT 
Leu 


CCA 
Pro 


840 


CCG GCG GGT ACC GGA GAT AGA CTA CTA GCT CTA GTC AAA GGA GCC TAT 
Pro Ala Gly Thr Gly Asp Arg Leu Leu Ala Leu Val Lys Gly Ala Tyr 


CAG 
Gin 


GCG 
Ala 


CTT 
Leu 


AAC 
Asn 


900 


CTC ACC AAT CCC GAC AAG ACC CAA GAA TGT TGG CTG TGC TTA GTC TCG 
Leu Thr Asn Pro Asp Lys Thr Gin Glu Cys Trp Leu Cys Leu Val Ser 


GGA 
Gly 


CCT 
Pro 


CCT 
Pro 


TAT 
Tyr 


960 


TAC GAA GGA GTA GOG GTC GTG GGC ACT TAT ACC AAT CAT TCC ACC GCT 
Tyr Glu Gly Val Ala Val Val Gly Thr Tyr Thr Asn His Ser Thr Ala 


CCG 
Pro 


GCC 
Ala 


AAC 
Asn 


TGT 
Cys 


1020 


ACG GCC ACT TCC CAA CAT AAG CTT ACC CTA TCT GAA GTG ACA GGA CAG 
Thr Ala Thr Ser Gin His Lys Leu Thr Leu Ser Glu Val Thr Gly Gin 


GGC 
Gly 


CTA 
Leu 


TGC 
Cys 


ATG 
Met 


1080 


CGG GCA GTA CCT AAA ACT CAC CAG GCC TTA TGT AAC ACC ACC CAA AGC 
Gly Ala Val Pro Lys Thr His Gin Ala Leu Cys Asn Thr Thr Gin Ser 


GCC 
Ala 


GGC 
Gly 


TCA 
Ser 


GGA 
Gly 


1140 


TCC TAC TAC CTT GCA GCA CCC GCC GGA ACA ATG TGG GCT TGC AGC ACT 
Ser Tyr Tyr Leu Ala Ala Pro Ala Gly Thr Met Trp Ala Cys Ser Thr 


GGA 
Gly 


TTC 
Leu 


ACT 
Thr 


CCC 
Pro 


1200 
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TGC TTG TCC ACC ACG GTG CTC AAT CTA ACC ACA GAT TAT TGT GTA TTA GTT GAA CTC TGG 1260 

Cys Leu Ser Thr Thr Val Leu Asn Leu Thr Thr Asp Tyr Cys Val Leu Val Glu Leu Trp 

CCC AGA GTA ATT TAC CAC TCC CCC GAT TAT ATG TAT GCT CAG CTT GAA CAG CGT ACC AAA 1320 
Pro Arg Val He Tyr His Ser Pro Asp Tyr Met Tyr Gly Gin Leu Glu Gin Arg Thr Lys 

TAT AAA AGA GAG CCA GTA TCA TTG ACC CTG GCC CTT CTA CTA GGA GGA TTA ACC ATG GGA 1380 
Tyr Lys Arg Glu Pro Val Ser Leu Thr Leu Ala Leu Leu Leu Gly Gly Leu Thr Met Gly 

GGG ATT GCA GCT GGA ATA GOG ACG GGG ACC ACT GCC TTA AW -AAA ACC CAG CAG TTT GAG 1440 
Gly He Ala Ala Gly He Gly Thr Gly Thr Thr Ala Leu lie Lys Thr Gin Gin Phe Su 

CAG CTT CAT GCC GCT ATC CAG ACA GAC CTC AAC GAA GTC GAA AAG TCA ATT ACC AAC CTA 1500 
Gin Leu His Ala Ala He Gin Thr Asp. Leu Asn Glu Val Glu Lys S He Sr 2 

GAA AAG TCA CTG ACC TCG TTG TCT GAA GTA GTC CTA CAG AAC CGC AGA GGC CTA GAT TTG 1560 
Glu Lys Ser Leu Thr Ser Leu Ser Glu Val Val Leu Gin Asn Arg Arg Gly Su 2p Su 

It^ CTA AAG GAG GGA GGT CTC TGC GCA GCC CTA AAA GAA GAA TGT TGT TTT TAT GCA 
Leu Phe Leu Lys Glu Gly Gly Leu Cys Ala Ala Leu Lys Glu Glu Cys Cys Phe Tyr Ala 

GGG CTA GTG AGA GAC AGC ATG GCC AAA TTA AGA GAA AGG CTT AAT CAG AGA 
Asp His Thr Gly Leu Val Arg Asp Ser Met Ala Lys Leu- Arg Glu Arg Leu Asn Gin Arg 

CAA AAA CTA TTT GAG ACA GGC CAA GGA TGG TTC GAA GGG CTG TTT AAT AGA TCC CCC TGG 
Gin Lys Leu Phe Glu Thr Gly Gin Gly Trp Phe Glu Gly Leu Phe Asn Arg Ser Pro Trp 

w!' nfc^ TTA ATC TCC ACC ATC ATG GGA CCT CTA ATA GTA CTC TTA CTG ATC TTA CTC 

Phe Thr Thr Leu He Ser Thr He Met Gly Pro Leu He Val Leu Leu 2u 5 Eu" 2u 

TGC ATT CTC AAT CGA TTA GTT CAA TTT GTT AAA GAC AGG ATC TCA GTA GTC 
Phe Gly Pro Cys He Leu Asn Arg Leu Val Gin Phe Val Lys Asp Arg He Ser Val 

CAG .GCT TTA GTC CTG ACT CAA CAA TAC CAC CAG CTA AAA CCA CTA GAA TAC GAG CCA 
Gin Ala Leu val Leu Thr Gin Gin Tyr His Gin Leu Lys Pro Leu Glu Tyr Glu Pro 



1620 
1680 
1740 
1800 
1860 
1917 
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ATG GTA TTG CTG CCT GGG TCC ATG CTT CTC ACC TCA AAC CTG CAC CAC CTT CGG CAC CAG 60 
Met Val Leu Leu Pro Gly Ser Met Leu Leu Thr Ser Asn Leu His His Leu Arg His Gin 

ATG AGT CCT GGG AGC TGG AAA AGA CTG ATC ATC CTC TTA AGC TGC GTA TTC GGC GGC GGC 120 
Met Ser Pro Gly Ser Trp Lys Arg Leu He He -Leu Leu Ser Cys Val Phe Gly Gly Gly 

GGG ACG AGT CTG CAG AAT AAG AAC CCC CAG CCC ATG ACC CTC ACT TGG CAG GTA CTG TCC 180 
Gly Thr Ser Leu Gin Asa .Lys Asn Pro Gin Pro Met Thr Leu Thr Trp G1A, Val Leu Ser 

CAA ACT GGA GAG GTT GTC TGG GAT ACA AAG GCA GTC CAG CCC CCT TGG ACT TGG TGG CCC 240 
Gin Thr Gly Asp Val Val Trp Asp Thr Lys Ala Val Gin Pro Pro Trp Thr Trp Trp Pro 

ACA CTT AAA CCT GAT GTA TGT GCC TTG GOG GCT AGT CTT GAG TCC TGG GAT ATC CCG GGA 300 
Thr Leu Lys Pro Asp Val Cys Ala Leu Ala Ala Ser Leu Glu Ser Trp Asp He Pro Gly 

ACC GAT GTC TOG TCC TCT AAA GGA GTC AGA CCT CCG GAC TCA GAC TAT ACT GCC GCT TAT 360 
Thr Asp Val Ser Ser Ser Lys Arg Val Arg Pro Pro Asp Ser Asp Tyr Thr Ala Ala Tyr 

AAG CAA ATC ACC TGG GGA GCC ATA GGG TGC AGC TAG CCT CGG GCT AGG ACT AGA ATG GCA 420 
Lys Gin He Thr Trp Gly Ala He Gly Cys Sex Tyr Pro Arg Ala Arg Thr Arg Met Ala 

AGC TCT ACC TTC TAC GTA TGT CCC CGG GAT GGC CGG ACC CTT TCA GAA GCT AGA AGG TGC 480 
Ser Ser Thr Phe Tyr Val Cys Pro Arg Asp Gly Arg Thr Leu Ser Glu Ala Arg Arg Cys 

GGG GGG CTA GAA TCC CTA TAC TGT AAA GAA TGG GAT TGT GAG ACC ACG GGG .ACC GGT TAT 540 
Gly Gly Leu Glu Ser Leu Tyr Cys Lys Glu Trp Asp Cys Glu Thr Thr Gly Hut Gly Tyr 

TGG CTA TCT AAA TCC TCA AAA GAC CTC ATA ACT GTA AAA TGG GAC CAA AAT AGC GAA TGG 600 
Trp Leu Ser Lys Ser Ser Lys Asp Leu He Thr Val Lys Trp Asp Gin Asn Ser Glu Trp 

ACT CAA AAA TTT CAA CAG TGT CAC CAG ACC GGC TGG TGT AAC CCC CTT AAA ATA GAT TTC 660 
Thr Gin Lys Phe Gin Gin cys His Gin Thr Gly Trp Cys Asn Pro Leu Lys He Asp Phe 

ACA GAC AAA GGA AAA TTA TCC AAG GAC TOG ATA ACG GGA AAA ACC TGG GGA TTA AGA TTC 720 
Thr Asp Lys Gly Lys Leu Ser Lys Asp Trp He Thr Gly Lys Thr Trp Gly Leu Arg Phe 

TAT GTG TCT GGA CAT CCA GGC GTA CAG TTC ACC ATT CGC TEA AAA ATC ACC AAC ATG CCA 780. 
Tyr Val Ser Gly His Pro Gly Val Gin Phe Thr He Arg Leu Lys He Thr Asn Met Pro 

GCT GTG GCA GTA GGT CCT GAC CTC GTC CTT GTG GAA CAA GGA CCT OCT AGA ACG TCC CTC 840 
Ala Val Ala Val Gly Pro Asp Leu Val Leu Val Glu Gin Gly Pro Pro Arg Thr Ser Leu 

GCT CTC CCA CCT CCT CTT CCC CCA AGG GAA GCG CCA CCG CCA TCT CTC CCC GAC TCT AAC 900 
Ala Leu Pro Pro Pro Leu Pro Pro Arg Glu- Ala Pro Pro Pro Ser Leu Pro Asp Ser Asn 

TCC ACA GCC CTG GCG ACT AGT GCA CAA ACT 4 - CCC ACG GTG AGA AAA ACA ATT GTT ACC CTA 960 
Ser Thr Ala Leu Ala Thr Ser Ala Gin Thr Pro Thr Val Arg Lys Thr lie Val Thr Leu 

AAC ACT CCG CCT CCC ACC ACA GGC GAC AGA CTT TTT GAT CTT GTG CAG GGG GCC TTC CTA 1020 
Asn Thr Pro Pro Pro Thr Thr Gly Asp Arg Leu Phe Asp Leu Val Gin Gly Ala Phe Leu 

ACC TTA AAT GCT ACC AAC CCA GGG GCC ACT GAG TCT TGC TGG CTT TGT TTG GCC ATG GGC 1080 
Thr Leu Asn Ala Thr Asn Pro Gly Ala Thr Glu Ser Cys Trp Leu Cys Leu Ala Met Gly 

CCC CCT TAT TAT GAA GCA ATA GCC TCA TCA GGA GAG GTC GCC TAC TCC ACC GAC CTT GAC 1140 
Pro Pro Tyr Tyr Glu Ala lie Ala Ser Ser Gly Glu Val Ala Tyr Ser Thr Asp Leu Asp 

CGG TGC CGC TGG GGG ACC CAA GGA AAG CTC ACC CTC ACT GAG GTC TCA GGA CAC GGG TTG 1200 
Arg Cys Arg Trp Gly Thr Gin Gly Lys Leu Thr Leu Thr Glu Val Ser Gly His Gly Leu 
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CTC 
Leu 


TGC 
Cys 


AAT 
Asn 


CAG ACC CTA TCC ATC AAT 
Gin Thr Leu Ser He Asn 


1260 


AAC 
Asn 


CAT 
His 


AGC 
Ser 


TGG TGG GCT, TGC AGC ACT 
Trp Trp Ala*Cys Ser Thr 


1320 


AAT 
Asn 


CAG 
Gin 


ACT 
Thr 


AGA GAT TTC TGT ATC! CAG 
Arg Asp Phe Cys lie Gin 


1380 


GAA 
Glu 


GAA 
Glu 


GTT 
Val 


TT*A f*fcf2 fYY"* rnjirp r??^/^ 

Leu Leu Gin Ala Tyr Asp 




Ser 


CTT 

Leu 


Thr 


CTA GCT GTT TTA CTG GGG 
Leu Ala Val Leu Leu Gly 


JLDUU 


— X 

Thr 


Ala 


TTA 
Leu 


All AAA QGA OCT ATA GAC 

He Lys Gly Pro He Asp 


JLbou 


He 


Asp 


GCT 
Ala 


GAC CTC CGG GCC CTC CAA 
Asp Leu Arg Ala Leu Gin 


1620 


Ser 


CTG 
Leu 


Iv-C 

Ser 


CaAG GTA GTG CTC CAA AAT 

Glu Val Val Leu Gin Asn 


1680 


Gly 


Gly 


Leu 


1\*T GCG GCC CTA AAG GAA 

Cys Ala Ala Leu Lys Glu 


1 *T A ft 

1740 


GTA 
Val 


CGG 
Arg 


GAC 
Asp 


TCC ATG AAA AAA CTC AAA 
Ser Met Lys Lys Leu Lys 


1800 


AAA 
Lys 


AGC 
Ser 


CAA 
Gin 


AAC TGG TAT GAA GGA TOG 
Asn Trp Tyr Glu Gly Trp 


1860 


TCA 
Ser 


ACC 
Thr 


ATC 
lie 


GCT GGG CCC CTA TTA CTC 
Ala Gly Pro Leu Leu Leu 


1920 


atc 
He 


AAT 
Asn 


CGA 
Arg 


TTA GTT CAA TTT GTT AAA 
Leu Val Gin Phe Val Lys 


1980 


ACT 
Thr 


CAA 
Gin 


CAA 
Gin 


TAC CAC CAG CTA AAG CCT 
Tyr His Gin Leu Lys Pro 


2040 



ATA GAG TAC GAG CCA 2055 
He Glu Tyr Glu Pro 
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ATG GCG CGT TCA ACG CTC TCA AAA CCC CCT CAA GAT AAG ATT AAC CCG TOG AAG CCC TTA 60 
Met Ala Arg Ser Thr Leu Ser Lys Pro Pro Gin Asp Lys lie Asn Pro Trp Lys Pro Leu 

ATA GTC ATG GGA GTC CTG TTA GGA GTA GGG ATG GCA GAG AGC CCC CAG GTC TTT AAT GTA 120 
lie Val Met Gly Val Leu Leu Gly Val Gly Met Ala Glu Ser Pro Gin Val Phe Asn Val 

ACC TGG AGA GTC ACC AAC CTG ATG ACT GGG CGT ACC GCC AAT GCC ACC TC£ CTC CTG GGA 180 
Thr Trp Arg Val Thr Asn Leu Met Thr Gly Arg Thr Ala Asn Ala Thr Ser Leu Leu Gly 

ACT GTA CAA GAT GCC TTC CCA AAA TTA TAT TTT GAT CTA TGT GAT CTG GTC GGA GAG GAG 240 
Thr Val Gin Asp Ala Phe Pro Lys Leu Tyr Phe Asp Leu Cys Asp Leu Val Gly Glu Glu 

TGG GAC CCT TCA GAC CAG GAA CCG TAT GTC GGG TAT GGC TGC AAG TAC CCC GCA GGG AGA 300 
Trp Asp Pro Ser Asp Gin Glu Pro Tyr Val Gly Tyr Gly Cys Lys Tyr Pro Ala Gly Arg 

CAG CGG ACC CGG ACT TTT GAC TTT TAC GTG TGC OCT GGG CAT ACC GTA AAG TOG GGG TGT 360 
Gin Arg Thr Arg ' Thr Phe Asp Phe Tyr Val Cys Pro Gly His Thr Val Lys Ser Gly Cys 

GGG GGA CCA GGA GAG GGC TAC TGT GGT AAA TGG GGG TGT GAA ACC ACC GGA CAG GCT TAC 420 
Gly Gly Pro Gly Glu Gly Tyr Cys Gly Lys Trp Gly Cys Glu Thr Thr Gly Gin Ala Tyr 

TGG AAG CCC ACA TCA TCG TGG GAC CTA ATC TCC CTT AAG CGC GGT AAC ACC CCC TGG GAC 480 
Trp Lys Pro Thr . Ser Ser Trp Asp Leu He Ser Leu Lys Arg Gly Asn Thr Pro Trp Asp 

ACG GGA TGC TCT AAA GTT GCC TGT GGC CCC TGC TAC GAC CTC TCC AAA GTA TCC AAT TCC 540 
Thr Gly Cys Ser Lys Val Ala Cys Gly Pro Cys Tyr Asp Leu Ser Lys . Val Ser Asn Ser 

TTC CAA GGG GCT ACT CGA GGG GGC AGA TGC AAC CCT CTA GTC CTA GAA TTC ACT GAT GCA 600 
Phe Gin Gly Ala Thr Arg Gly Gly Arg Cys Asn Pro Leu Val Leu Glu Phe Thr Asp Ala 

GGA AAA AAG GCT AAC TGG GAC GGG CCC AAA TCG TGG GGA CTG AGA CTG TAC CGG ACA GGA 660 
Gly Lys Lys Ala Asn Trp Asp Gly Pro Lys Ser Trp Gly Leu Arg Leu Tyr Arg Thr Gly 

ACA GAT CCT ATT ACC ATG TTC TCC CTG ACC CGG CAG GTC CTT AAT GTG GGA CCC CGA GTC 720 
Thr Asp Pro He Thr Met Phe Ser Leu Thr Arg Gin Val Leu Asn Val Gly Pro Arg Val 

CCC ATA GGG CCC AAC CCA GTA TTA CCC GAC CAA AGA CTC OCT TCC TCA CCA ATA GAG ATT 780: 
Pro He Gly Pro Asn Pro Val Leu Pro Asp Gin Arg Leu Pro Ser Ser Pro He Glu lie 

GTA CCG GCT CCA CAG CCA CCT AGC CCC CTC AAT ACC AGT TAC CCC CCT TCC ACT ACC AGT 840 
Val Pro Ala Pro Gin Pro Pro Ser Pro Leu Asn Thr Ser Tyr Pro Pro Ser Thr Thr Ser 

ACA CCC CCC CCA GGT ACC GGA GAT AGA CTA CTA GCT CTA GTC AAA GGA GCC TAT CAG GCG 900 
Thr Pro Pro Pro Gly Thr Gly Asp Arg Leu Leu Ala Leu Val Lys Gly Ala Tyr Gin Ala 

CTT AAC CTC ACC AAT CCC GAC AAG ACC CAA GAA TGT TGG CTG TGC TTA GTG TCG GGA CCT 960 
Leu Asn Leu Thr Asn Pro Asp Lys Thr Gin Glu Cys Trp Leu Cys Leu Val Ser Gly Pro 

CCT TAT TAC GAA GGA GTA GCG GTC GTG GGC ACT TAT ACC AAT CAT TCC ACC GCT CCG GCC 1020 
Pro Tyr Tyr Glu Gly Val Ala Val Val Gly Thr Tyr Thr Asn His Ser Thr Ala Pro Ala 

AAC TGT ACG GCC ACT TCC CAA CAT AAG CTT ACC CTA TCT GAA GTG ACA GGA CAG GGC CTA 1080 
Asn Cys Thr Ala Thr Ser Gin His Lys Leu Thr Leu Ser Glu Val Thr Gly Gin Gly Leu 

TGC ATG GGG GCA GTA CCT AAA ACT CAC CAG GCC TTA TGT AAC ACC ACC CAA AGC GCC GGC 1140 
Cys Met Gly Ala Val Pro Lys Thr His Gin Ala Leu Cys Asn Thr Thr Gin Ser Ala Gly . 

TCA GGA TCC TAC TAC CTT GCA GCA CCC GCC GGA ACA ATG TGG GCT TGC AGC ACT GGA TTG 1200 
Ser Gly Ser Tyr Tyr Leu Ala Ala Pro Ala Gly Thr -Met Trp Ala Cys Ser Thr Gly Leu 
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C2delH Sequence 

10 20 30 40 ~~ ~ 50 60 



123 456 789 012 345 678 901 234 567 890 123 7RQ 


012 


345 


678 


901 


234 


567 


890 




ACT CCC TGC TTG TCC ACC ACG GTG CTC AAT CTA AOC ACA 
Thr Pro Cys Leu Ser Thr Thr Val Leu Asn Leu Thr Thr 


GAT 
Asp 


TAT 
Tyr 


TGT 
Cys 


GTA 


TTA 
Leu 


GTT 
Val 


GAA 
Glu 


1260 


CTC TGG CCC AGA GTA ATT TAG CAC TCC CCC GAT TAT ATC 
Leu Trp Pro Arg Val He Tyr His Ser Pro Asp Tyr Met 


TAT 
Tyr 


GGT 
Gly 


CAG 
Gin 


CTT 
lieu 


GAA 
Glu 


CAG 
Gin 


CGT 
Arc 


1320 


ACC AAA TAT AAA AGA GAG CCA GTA TCA TTG ACC JCTC GCC 
Thr Lys Tyr Lys Arg Glu Pro Val Ser Leu Thr Leu Ala 


CTT 
Leu 


CTA 
Leu 


CTA 
Leu 


GGA 
Glv 


GGA 


TTA 

T mi 
licU 


ACC 

XXiX 


1380 


ATG GGA GGG ATT GCA GCT GGA ATA GGG ACG GOG ACC ACT 
Met Gly Gly lie Ala Ala Gly He Gly Thr Gly Thr Thr 


GCC 
Ala 


TTA 
Leu 


ATT 

Tl t* 

lie 


AAA 

Lys 


ACC 
Thr 


CAG 

-GJJQ. 


CAG 
\31X1 


1440 


TTT GAG CAG CTT CAT GCC GCT ATC GAG ACA GAC CTC AAC 
Phe Glu Gin Leu His Ala Ala He Gin Thr Asp Leu Asn 


GAA 
uxu 


GTC 

Vol 


GAA 

IxJLU 


AAG 

Lys 


TCA 
Ser 


ATT 
lie 


ACC 
Thr 


1500 


AAC CTA GAA AAG TCA CTG ACC TOG TTG .TCT GAA GTA GTC 
Asn Leu Glu Lys Ser Leu Thr Ser Leu Ser Glu Val Val 


CTA 
Leu 


CAG 
bin 


AAC 
Asn 


CGC 
Arg 


AGA 
Arg 


GGC 
Gly 


CTA 
Leu 


1560 


GAT TTG CTA TTC CTA AAG GAG GGA GGT CTC TGC GCA GCC 
nz*y j_rt:u. i-rt=u riic **=u uya «Jlu \jJLy wxy L6U Cys Ala Ala 


CTA 
Leu 


AAA 
Lys 


GAA 
Glu 


GAA 
Glu 


TGT 

cys 


TGT 
Qys 


TTT 

Phe 


1620 


"CAT GCA GAC CAC ACG GGG CTA GTG AGA GAC AGC ATG GCC 
Tyr Ala Asp His Thr Gly Leu Val Arg Asp Ser Met Ala 


AAA 
Lys 


TTA 
Leu 


AGA 
Arg 


GAA 
Glu 


AGO 
Arg 


CTT 
Leu 


AAT 
Asn 


1680 


CAG AGA CAA AAA CTA TTT GAG ACA GGC CAA GGA TGG TTC 
Gin Arg Gin Lys Leu Phe Glu Thr Gly Gin Gly Trp Phe 


GAA 
Glu 


GGG 
Gly 


CTG 
Leu 


TTT 
Phe 


AAT 
Asn 


AGA 
Arg 


TCC 
Ser 


1740 


CCC TGG TTT ACC ACC TTA ATC TCC ACC ATC ATG GGA CCT 
Pro Trp Phe Thr Thr lieu He Ser Thr He Met Gly Pro 


CTA 
Leu 


ATA 
He 


GTA 
Val 


CTC 
Leu 


TTA 
Leu 


CTG 
Leu 


ATC 
He 


1800 


TTA CTC TTT GGA CCT TGC ATT CTC AAT GGA TTG GTC CAA 
Leu Leu Phe Gly Pro Cys He Leu Asn Arg Leu Val Gin 


TTT 

Phe 


GTT 
Val 


AAA 

Lys 


GAC 
Asp 


AGG 
Arg 


ATC 
He 


TCA 

Ser 


1860 


GTG GTC CAG GCT CTG GTT TTG ACT CAG CAA TAT CAC CAG 
Val Val Gin Ala Leu Val Leu Thr Gin Gin Tyr His Gin 


CTA 
Leu 


AAA 
Lys 


CCC 
Pro 


ATA 
He 


GAG 
Glu 


TAC 
Tyr 


GAG 
Glu 


1920 



CCA TGA 
Pro . . . 



FIG. 18 (continued) 
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ATG GCG CGT TCA ACG CTC TCA AAA CCC CCT CAA GAT AAG ATT AAC CCG TGG AAG CCC TTA 60 
Met Ala Arg Ser Thr Leu Ser Lys Pro Pro Gin Asp Lys He Asn Pro Trp Lys Pro Leu 

ATA GTC ATG GGA GTC CTG TTA GGA GTA GGG ATG GCA GAG AGC CCC CAG GTC TTT .AAT GTA 120 
Xle Val Met Gly Val Leu Leu Gly Val Gly Met Ala Glu Ser Pro Gin Val Phe Asn Val 

ACC TGG AGA GTC ACC AAC CTG ATG ACT GGG CGT ACC GCC AAT GCC ACC TCC CTC CTG GGA 180 
Thr Trp Arg Val Thr Asn Leu Met Thr Gly Arg Thr Ala Asn Ala Thr Ser Leu Leu Gly 

ACT GTA CAA GAT GCC TTC CCA AAA TTA TAT TTT GAT CTA TGT GAT CTG GTC GGA GAG GAG 240 
Thr Val Gin Asp Ala Phe Pro Lys Leu Tyr Phe Asp Leu Cys Asp Leu Val Gly Glu Glu 

TGG GAC CCT TCA GAC CAG GAA CCG TAT GTC GGG TAT GGC TGC AAG TAC CCC GCA GGG AGA 300 
Tip Asp Pro Sex Asp Gin Glu Pro Tyr Val Gly Tyr Gly Cys Lys Tyr Pro Ala Gly Arg 

CAG CGG ACC COG ACT TTT GAC TTT TAC GTG TGC CCT GGG CAT ACC GTA AAG TOG GGG TGT 360 
Gin Arg Thr Arg Thr Phe Asp Phe Tyr Val Cys Pro Gly His Thr Val Lys Ser Gly Cys 



TGT GGT AAA TGG GGG TGT GAA ACC ACC GGA CAG GCT TAC 420 
Cys Gly Lys Trp Gly Cys Glu Thr Thr Gly Gin Ala Tyr 

GAC CTA ATC TCC CTT AAG CGC GGT AAC ACC CCC TGG GAC 480 
Asp i^u lie Ser Leu Lys Arg Gly Asn Thr Pro Trp Asp 

TGT GGC CCC TGC TAC GAC CTC TCC AAA GTA TCC AAT TCC 540 
Cys Gly Pro Cys Tyr Asp Leu Ser Lys Val Ser Asn Ser 

GGC AGA TGC AAC CCT CTA GTC CTA GAA TTC ACT GAT GCA 600 
Gly Arg Cys Asn Pro Leu Val Leu Glu Phe Thr Asp Ala 

GGG CCC AAA TCG TGG GGA CTG AGA CTG TAC CGG AGA GGA 660 
Gly Pro Lys Ser Trp Gly Leu Arg Leu Tyr Arg Thr Gly 

TCC CTG ACC CGG CAG GTC CTT AAT GTG GGA CCC CGA GTC 720 
Ser Leu Thr Arg Gin Val Leu Asn Val Gly Pro Arg Val 

TTA CCC GAC CAA AGA CTC CCT TCC TCA CCA ATA GAG ATT 780 
Leu Pr-o Asp Gin Arg Leu Pro Ser Ser Pro He Glu He 

AGC CCC CTC AAT ACC AGT TAC CCC CCT TCC ACT ACC AGT 840 
Ser Pro Leu Asn Thr Ser Tyr Pro Pro Ser Thr Thr Ser 

ACA AGT CCA AGT GTC CCA CAG CCA CCC CCA GGA ACT GGA GAT 900 
Thr Ser Pro Ser Val Pro Gin Pro Pro Pro Gly Thr Gly Asp 

GGA GCC TAT CAG GCG CTT AAC CTC ACC AAT CCC GAC AAG 960 
Gly Ala Tyr Gin Ala Leu Asn Leu Thr Asn Pro Asp Lys 

TTA GTG TCG GGA CCT CCT TAT TAC GAA GGA GTA GCG GTC 1020 
Leu Val Ser Gly Pro Pro Tyr Tyr Glu Gly Val Ala Val 

TCC ACC GCT CCG GCC AAC TGT ACG GCC ACT TCC CAA CAT 1080 
Ser Thr Ala Pro Ala Asn Cys Thr Ala Thr Ser Gin His 



GGG GGA CCA GGA GAG GGC TAC 
Gly Gly Pro Gly Glu Gly Tyr 

TGG AAG CCC ACA TCA TCG TGG 
Trp Lys Pro Thr Ser Ser Trp 

ACG GGA TGC TCT AAA GTT GCC 
Thr Gly Cys Ser Lys Val Ala 

TTC CAA GGG GCT ACT CGA GGG 
Phe Gin Gly Ala Thr Arg Gly 

GGA AAA AAG GCT AAC TGG GAC 
Gly Lys Lys Ala Asn Trp Asp 

ACA GAT CCT ATT ACC ATG TTC 
Thr Asp Pro lie Thr Met Phe 

CCC ATA GGG CCC AAC CCA GTA 
Pro He Gly Pro Asn Pro Val 

GTA CCG GCT CCA CAG CCA CCT 
Val Pro Ala Pro Gin Pro Pro 

ACA CCC TCA ACC TCC CCT 
Thr Pro Ser Thr Ser Pro 

AGA CTA CTA GCT CTA GTC AAA 
Arg Leu Leu Ala Leu Val Lys 

ACC CAA GAA TGT TGG CTG TGC 
Thr Gin Glu Cys Trp Leu Cys 

GTG GGC ACT TAT ACC AAT CAT 
Val Gly Thr Tyr Thr Asn His 

AAG CTT ACC CTA TCT GAA GTG 
Lys Leu Thr Leu Ser Glu Val 

CAC CAG GCC TTA TGT AAC ACC 
His Gin Ala Leu Cys Asn Thr 



ACA GGA CAG GGC CTA TGC ATG 
Thr Gly Gin Gly Leu Cys Met 

ACC CAA AGC GCC GGC TCA GGA 
Thr Gin Ser Ala Gly Ser Gly 



GGG GCA GTA CCT AAA ACT 1140 • 
Gly Ala Val Pro Lys Thr 

TCC TAC TAC CTT GCA GCA 1200 
Ser Tyr Tyr Leu Ala Ala 



FIG. 19 
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AdeiHRIess Sequence 



10 20 30 40 50 60 



123 456 789 012 345 678 901 234 567 890 123 456 


789 


012 


345 


678 


901 


234 


567 


890 




CCC GCC GGA ACA ATG TGG GCT TGC AGC ACT GGA TTG 
Pro Ala Gly Thr Met Trp Ala Cys Ser Thr Gly Leu 


ACT 
Thr 


CCC 
Pro 


TGC 
Cys 


TTG 
Leu 


TCC, ACC 
Ser*"Thr 


ACG 
Thr 


GTG 
Val 


1260 


CTC AAT CTA ACC ACA GAT TAT TGT GTA TTA GTT GAA 
Leu Asn Leu Thr Thr Asp iyr Cys Val Leu Val Glu 


CTC 
Leu 


TGG 
Trp 


CCC 
Pro 


AGA 
Arg 


GTA 
Val 


ATT 
He 


TAC 
Tyr 


CAC 
His 


1320 


TCC CCC GAT TAT ATG TAT GGT CAG CTT GAA CAG CGT 
Ser Pro Asp Tyr Met Tyr Gly Gin Leu Glu Gin Arg 


ACC 
Thr 


AAA 
Lys 


TAT 
Tyr 


AAA 
Lys 


AGA 
Arg 


GAG 
Glu 


CCA 
Pro 


GTA 
Val 


1380 


TCA TTG ACC CTG GCC CTT CTA CTA GGA GGA TTA ACC. 
Ser Leu Thr Leu Ala Leu Leu Leu Gly Gly Leu Thr 


ATG 
Met 


GGA 
Gly 


GGG 
Gly 


ATT 
He 


GCA 
Ala 


GCT 
Ala 


GGA 
Gly 


ATA 
He 


1440 


GGG ACG GGG ACC ACT GCC TTA ATT AAA ACC CAG CAG 
Gly Thr Gly Thr Thr Ala Leu He Lys Thr Gin Gin 


TTT 
Phe 


GAG 
Glu 


CAG 
Gin 


CTT 
Leu 


CAT 
His 


GCC 
Ala 


GCT 
Ala 


ATC 
He 


1500 


CAG ACA GAC CTC AAC GAA GTC GAA AAG TCA ATT ACC 
Gin Thr Asp Leu Asn Glu Val Glu Lys Ser He Thr 


AAC 
Asn 


CTA 
Leu 


GAA 
Glu 


AAG 
Lys 


TCA 
Ser 


CTG 
Leu 


ACC 
Thr 


TOG 
Ser 




*J.v» KaAA I? .LA. uTC wiA CAG AAC CGC AGA GGC CTA 
Leu Ser Glu Val Val Leu Gin Asn Arg Arg Gly Leu 


GAT 
Asp 


TTG 
Leu 


CTA 
Leu 


TTC 
Phe 


CTA 
Leu 


AAG 
Lys 


GAG GGA 
Glu Gly 


1620 


GGT CTC TGC GCA GCC CTA AAA GAA GAA TGT TGT TTT 
Gly Leu Cys Ala Ala Leu Lys Glu Glu Cys Cys Phe 


TAT 
Tyr 


GCA 
Ala 


GAC 
Asp 


CAC 
His 


ACG 
Thr 


GGG 
Gly 


CTA 
Leu 


GTG 
Val 


1680 


AGA GAC AGC ATG GCC AAA TTA AGA GAA AGG CTT AAT 
Arg Asp Ser Met Ala Lys Leu Arg Glu Arg Leu Asn 


CAG 
Gin 


AGA 
Arg 


CAA 
Gin 


AAA 
Lys 


CTA 
Leu 


TTT 

Phe 


GAG 
Glu 


ACA 
Thr 


1740 


GGC CAA GGA TGG TTC GAA GGG CTG TTT AAT AGA TCC 
Gly Gin Gly Trp Phe Glu Gly Leu Phe Asn Arg Ser 


CCC 
Pro 


TGG 
Trp 


TTT 

Phe 


ACC 
Thr 


ACC 
Thr 


TTA 
Leu 


ATC 
He 


TCC 
Ser 


1800 


ACC ATC ATG GGA CCT CTA ATA GTA CTC TTA CTG ATC 
Thx Xle Met Gly Pro Leu He Val Leu Leu Leu He 


TTA 
Leu 


CTC 
Leu 


TTT 
Phe 


GGA 
Gly 


CCT 
Pro 


TGC 
Cys 


ATT 
He 


CTC 
Leu 


I860! 


AAT CGA TTA GTT CAA TTT GTT AAA GAC AGG ATC TCA 
Asn Arg Leu Val Gin Phe Val Lys Asp Arg He Ser 


GTA 
Val 


GTC 
Val 


CAG 
Gin 


GCT 
Ala 


TEA 
Leu 








1911 
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PROMOdelHRiess Sequence 



10 20 30 40 50 60 

123 456 789 012 345 678 901 234 567 890 123 456 789 012 345 678 901 234 567 890 



ATT 
lie 


AAC 
Asn 


CCG 
Pro 


TGG AAG 
Trp Lys 


CCC TTA 
Pro Leu 


60 


AGC 
Ser 


CCC 
Pro 


CAG 
Gin 


GTC TTT 
Val Phe 


AAT GTA 
Asn Val 


120 


AAT 
Asn 


GCC 
Ala 


ACC 
Thr 


TCC CTC 
Ser "Leu 


CTG GGA 
Leu Gly 


180 


TGT 
cys 


GAT 
Asp 


CTG 
Leu 


GTC GGA 
Val Gly 


GAG GAG 
Glu Glu 


240 


TGC 

cys 


AAG 
Lys 


TAG 
Tyr 


CCC GCA 
Pro Ala 


GGG AGA 
Gly Arg 


300 


CAT 

His 


ACC 
Thr 


GTA 
Val 


AAG TCG 
Lys Ser 


GGG TGT 
Gly Cys 


360 


GAA 
Glu 


ACC 
Thr 


ACC 
Thr 


GGA CAG 
Gly Gin 


GCT TAG 
Ala Tyr 


420 


CGC 
Arg 


GGT 
Gly 


AAC 
Asn 


ACC CCC 
Thr Pro 


TGG GAC 
Trp Asp 


480 


CTC 
Leu 


TCC 
Ser 


AAA 
Lys 


GTA TCC 
Val Ser 


AAT TCC 
Asn Ser 


540 


GTC 
Val 


CTA 
Leu 


GAA 
Glu 


TTC ACT 
Phe Thr 


GAT GCA 
Asp Ala 


600 


CTG 
Leu 


AGA 
Arg 


CTG 
Leu 


TAG COG 
Tyr Arg 


ACA GGA 
Thr Gly 


660 


CTT 
Leu 


AAT 
Asn 


GTG 
Val 


GGA CCC 
Gly Pro 


CGA GTC 
Arg Val 


720 


CTC 
Leu 


TCC 
Ser 


AAG 

kys 


CCC AAA 
Pro Lys 


CCT GTT 
Pro Val 


780 


CTC 
Leu 


TCC 
Ser 


OCT 
Pro 


ACC CAA 
Thr Gin 


CTT CCA 
Leu Pro 


840 


GGA 
Gly 


GCC 
Ala 


TAT 
Tyr 


CAG GOG 
Gin Ala 


CTT AAC 
Leu Asn 


900 


TTA 
Leu 


GTG 
Val 


TCG 
Ser 


GGA CCT 
Gly Pro 


CCT TAT 
Pro Tyr 


960 


TCC 
Ser 


ACC 
Thr 


GCT 
Ala 


CCG GCC 
Pro Ala 


AAC TGT 
Asn Cys 


1020 


ACA 
Thr 


GGA 
Gly 


CAG 
Gin 


GGC CTA 
Gly Leu 


TGC ATG 
Cys Met 


1080 


ACC 
Thr 


CAA 
Gin 


AGC 
Ser 


GCC GGC 
Ala Gly 


TCA GGA 
Ser Gly 


1140 


TGC 
Cys 


AGC 
Ser 


ACT 
Thr 


GGA TTG 
Gly Leu 


ACT CCC 
Thr Pro 


1200 



FIG. 20 
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PROMOdelHRIess Sequence 



10 20 30 40 50 60 



123 456 789 012 345 678 901 234 567 890 123 456 


789 


012 


345 


678 


90L. 234 


567 


890 




. TGC TTG TCC ACC ACG GTG CTC AAT CTA ACC ACA GAT 
Cys Leu Ser Thr Thr Val Leu Asn Leu Thr Thr Asp 


TAT 
Tyr 


TGT 
Cys 


GTA 
Val 


TTA 
Leu 


GTT 
val 


GAA 
Glu 


CTC 
Leu 


TGG 
Trp 


1260 


CCC AGA GTA ATT TAG CAC TCC CCC GAT TAT ATG TAT 
Pro Arg Val lie Tyr His Ser Pro Asp Tyr Met Tyr 


GOT 
Gly 


CAG 
Gin 


CTT 
Leu 


GAA 
Glu 


CAG 
Gin 


CGT 
Arg 


ACC 
Thr 


AAA 
Lys 


1320 


TAT AAA AGA GAG CCA GTA TCA TTG ACC CTG GCC CTT 
Tyr Lys Arg Glu Pro Val Ser Leu Thr Leu Ala Leu 


CTA 
Leu 


CTA 
Leu 


GGA 
Gly 


GGA 
Gly 


TTA 
Leu 


ACC 
Thr 


ATG 

Met 


GGA 
Gly 


1380 


GGG ATT GCA GCT GGA ATA GGG ACG GGG ACC ACT GCC 
Gly lie Ala Ala Gly He Gly Thr Gly Thr Thr Ala 


TTA 
Leu 


ATT 
He 


AAA 
Lys 


ACC 
Thr 


CAG 
Gin 


CAG 
Gin 


TTT 
Phe 


GAG 
Glu 


1440 


CAG CTT CAT GCC GCT ATC CAG ACA GAC CTC AAC GAA 
Gin Leu His Ala Ala lie Gin Thr Asp Leu Asn Glu 


GTC 
Val 


GAA 
Glu 


AAG 
Lys 


TCA 
Ser 


ATT 
He 


ACC 
Thr 


AAC 
Asn 


CTA 
Leu 


1500 


GAA AAG TCA CTG ACC TOG TTG TCT GAA GTA GTC CTA 
Glu Lys. Ser Leu Thr Ser Leu Ser Glu Val Val Leu 


CAG 
Gin 


AAC 
Asn 


CGC 
Arg 


AGA 


Gly 


Leu 


GAT 
Asp 


TTG 
Leu 


1560 


CTA. TTC CTA AAG GAG GGA GCT CTC TGC GCA GCC CTA 
Leu Phe Leu Lys Glu Gly Gly Leu Cys Ala Ala Leu 


AAA 
Lys 


GAA 
Glu 


GAA 
Glu 


TGT 

cys 


TGT 
Cys 


TTT 

Phe 


TAT 
Tyr 


GCA 
Ala 


1620 


GAC CAC ACG GGG CTA GTG AGA GAC AGC ATG GCC AAA 
Asp His Thr Gly Leu Val Arg Asp Ser Met Ala Lys 


TTA 
Leu 


AGA 
Arg 


GAA 
Glu 


AGG 
Arg 


CTT 
Leu 


AAT 
Asn 


CAG 
Gin 


AGA 
Arg 


1680 


CAA AAA CTA. TTT GAG ACA GGC CAA GGA TGG TTC GAA 
Gin Lys Leu Phe Glu Thr Gly Gin Gly Trp Phe Glu 


GGG 
Gly 


CTG 
Leu 


TTT 

Phe 


AAT 
Asn 


AGA 
Arg 


TCC 
Ser 


CCC 
Pro 


TGG 
Trp 


1740 


TTT ACC ACC TTA ATC TCC ACC ATC ATG GGA OCT CTA 
Phe Thr Thr Leu He Ser Thr He Met Gly Pro Leu 


ATA 
He 


GTA 
Val 


CTC 
Leu 


TTA 
Leu 


CTG 
Leu 


ATC 
lie 


TTA 
Leu 


CTC 
Leu 


1800 . 


TTT GGA CCT TGC ATT CTC AAT GGA TTA GTT CAA TTT 
Phie Gly Pro Cys He Leu Asn Arg Leu Val Gin Phe 


GTT 
Val 


AAA 
Lys 


GAC 
Asp 


AGG 
Arg 


ATC 
He 


TCA 
Ser 


GTA 
Val 


GTC 
Val 


1860 


CAG GCT TTA 
Gin Ala Leu 


















1869 



FIG. 20 (continued) 
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10 20 30 40 50 60 

123 456 789 012 345 678 901 234 567 890 123 456 789 012 345 678 901 234 567 890 

ATG GTA TTG CTG CCT GGG TCC ATG CTT CTC ACC TCA AAC CTG CAC CAC CTT CGG CAC CAG 60 

Met Val Leu Leu Pro Gly Ser Met Leu Leu Thr Ser Asn Leu His His Leu Arg His Gin 



CTC 
Leu 


TTA 
Leu 


AGC 
Ser 


TGC 
Cys 


GTA TTC GGC GGC GGC 
Val Phe Gly Gly Gly 


120 


ATG 
Met 


ACC 
Thr 


CTC 
Leu 


ACT 
Thr 


TGG CAG'jGTA CTG TCC 
Trp Gin Val Leu Ser 


180 


GTC 
Val 


CAG 
Gin 


CCC 
Pro 


CCT 
Pro 


TGG ACT TGG TGG CCC 
Trp Thr Trp Trp Pro 


240 


AGT 
Ser 


CTT 
Leu 


GAG 
Glu 


TCC 
Ser 


TGG GAT ATC CCG GGA 
Trp Asp lie Pro Gly 


300 


CCG 
Pro 


GAC 
Asp 


TCA 
Ser 


GAC 
Asp 


TAT ACT GCC GCT TAT 
Tyr Thr Ala Ala Tyr 


360 


TAG 
?Yr 


CCT 
Pro 


CGG 
Arg 


GCT 
Ala 


AGG ACT AGA ATG GCA 
Arg Thr Arg Met Ala 


420 


CGG 
Arg 


ACC 
Thr 


CTT 
Leu 


TCA 
Ser 


GAA GCT AGA AGG TGC 
Glu Ala Arg Arg Cys 


480 


GAT 
Asp 


TGT 
Cys 


GAG 
Glu 


ACC 
Thr 


ACG GGG ACC GGT TAT 
Thr Gly Thr Gly Tyr 


540 


GTA 
Val 


AAA 
Lys 


TGG 
Trp 


GAC 
Asp 


CAA AAT AGC GAA TGG 
Gin Asn Ser Glu Trp 


600 


TGG 
Trp 


TGT 
Cys 


AAC 
Asn 


CCC 
Pro 


CTT AAA ATA GAT TTC 
Leu Lys He Asp Phe 


660 


ACG 
Thr 


GGA 
Gly 


AAA 
Lys 


ACC 
Thr 


TGG GGA TTA AGA TTC 
Trp Gly Leu Arg Phe 


720 


ATT 
He 


CGC 
Arg 


TTA 
Leu 


AAA 

Lys 


ATC ACC AAC ATG CCA 
He Tnr Asn Met Pro 


780 


GAA 
Glu 


CAA 
Gin 


GGA 
Gly 


CCT 
Pro 


CCT AGA ACG TCC CTC 
Pro Arg Thr Ser Leu 


840 


CCA 
Pro 


CCG 
Pro 


CCA 
Pro 


TCT 
Ser 


CTC CCC GAC TCT AAC 
Leu Pro Asp Ser Asn 


900 


ACG 
Thr 


GTG 
Val 


AGA 
Arg 


AAA 
Lys 


ACA ATT GTT ACC CTA 
Thr He Val Thr Leu 


960 


TTT 
Phe 


GAT 
Asp 


CTT 
Leu 


GTG 
Val 


CAG GGG GCC TTC CTA 
Gin Gly Ala Phe Leu 


1020 


TCT 
Ser 


TGC 
Cys 


TGG 
Trp 


CTT 
Leu 


TGT TTG GCC ATG GGC 
Cys Leu Ala Met Gly 


1080 


GAG 
Glu 


GTC 
Val 


GCC 
Ala 


TAC 
Tyr 


TCC ACC GAC CTT GAC 
Ser Thr Asp Leu Asp 


1140 


CTC 
Leu 


ACT 
Thr 


GAG 
Glu 


GTC 
Val 


TCA GGA CAC GGG TTG 
Ser Gly His Gly Leu 


1200 



FIG. 21 
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10 20 30 40 50 60 

123 456 7 89 012 345 678 901 2 34 567 890 123 456 789 012 345 678 901 234 567 890 
TGC ATA GGA AAG GTG CCC TTT ACC CAT CAG CAT CTC TGC AAT CAG ACC CTA TCC ATC AAT 
Cys He Gly Lys Val Pro Phe Thr His Gin His Leu Cys Asn Gin Thr Leu Ser lie Asn 



1260 



TCC TCC GGA GAC CAT CAG TAT CTG CTC CCC TCC AAC CAT AGC TGG TGG GCf vTOC AGC ACT 1320 
Ser Ser Gly Asp His Gin Tyr Leu Leu Pro Ser Asn His Ser Trp Trp Ala Cys Ser Thr 



1380 



1440 



1500 



GGC CTC ACC CCT TGC CTC TCC ACC TCA GTT TTT AAT CAG ACT AGA GAT TTC TOT ATC CAG 
Gly Leu Thr Pro Cys Leu Ser Thr Ser Val Phe Asn Gin Thr Arg Asp Phe Cys lie Gin 

GTC CAG CTG ATT CCT CGC ATC TAT TAC TAT CCT GAA GAA GTT TTG TTA CAG GOC TAT GAC 
Val Gin Leu He Pro Arg lie Tyr Tyr Tyr Pro Glu Glu Val Leu Leu Gin Ala Tyr Asp 

AAT TCT CAC CCC AGG ACT AAA AGA GAG GCT GTC TCA CTT ACC CTA GCT GTT TTA CTG GGG 
Asn Ser His Pro Arg Thr Lys Arg Glu Ala Val Ser Leu Thr Leu Ala Val Leu Leu Gly 

TTC GGA ATC ACG GCG GGA ATA GGT ACT GGT TCA ACT GCC TTA. ATT AAA GGA CCT ATA GAC 1560 
Leu Gly He Thr Ala Gly He Gly Thr Gly Ser Thr Ala Leu He Lys Gly Pro He Asp 

CTC CAG CAA GGC CTG ACA AGC CTC CAG ATC GCC ATA GAT GCT GAC CTC CGG GCC CTC CAA 
Leu Gin Gin Gly Leu Thr Ser Leu Gin He Ala He Asp Ala Asp Leu Arg Ala Leu Gin 

GAC TCA GTC AGC AAG TTA GAG GAC TCA CTG ACT TCC CTG TCC GAG GTA GTG CTC CAA AAT 
Asp Ser Val Ser Lys Leu Glu Asp Ser Leu Thr Ser Leu Ser Glu Val Val Leu Gin Asn 

AGS AGA GGC CTT GAC TTG CTG TTT CTA AAA GAA GGT GGC CTC TGT GCG GCC CTA AAG GAA 
.Arg Arg Gly Leu Asp Leu Leu Phe Leu Lys Glu Gly Gly Leu Cys Ala Ala Leu Lys Glu 

GAG TGC TGT TTT TAC ATA GAC CAC TCA GGT GCA GTA CGG GAC TCC ATG AAA AAA CTC AAA 
Glu Cys Cys Phe Tyr lie Asp His Ser Gly Ala Val Arg Asp Ser Met Lys Lys Leu Lys 

GAA AAA CTG GAT AAA AGA CAG TTA GAG CGC CAG AAA AGC CAA AAC TGG TAT GAA GGA TGG 
Glu Lys Leu Asp Lys Arg Gin Leu Glu Arg Gin Lys Ser Gin Asn Trp Tyr Glu Gly Trp 

TTC AAT AAC TCC OCT TGG TTC ACT ACC CTG CTA TCA ACC ATC GCT GGG OOC CTA TTA CTC 
Phe Asn Asn Ser Pro - Trp Phe Thr Thr Leu Leu Ser Thr He Ala Gly Pro Leu Leu Leu 

CTC CTT CTG TTG CTC ATC CTC GGG CCA TGC ATC ATC AAT AAG TTA GTT CAA TTC ATC AAT 1980 
Leu Leu Leu Leu Leu He Leu Gly Pro Cys He He Asn Lys Leu Val Gin Phe He Asn 

GAT AGG ATA AGT GCA TGT 
Asp Arg He Ser Ala Cys 



1620 
1680 
1740 
1800 
1860 
1920 
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123 456 789 012 345 678 901 234 567 890 123 456 789 012 345 678 901 234 567 890 
ATG GCG CGT TCA ACG CTC TCA AAA CCC CCT CAA GAT AAG ATT AAC CCG TGG AAG CCC TTA 
Met Ala Arg Ser Thr Leu Ser Lys Pro Pro Gin Asp Lys lie Asn Pro Trp Lys Pro Leu 

ATA GTC ATG GGA GTC CTG TTA GGA GTA GOG ATG GCA GAG AGC CCC CAG GTC TTT AAT GTA 
lie Val Met Gly Val Leu Leu Gly Val Gly Met Ala Glu Ser Pro Gin Val Phe Asn Val 

ACC TCG AGA GTC ACC AAC CTG ATG ACT GGG CGT ACC GCC AAT GCC ACC TCC.CTC CTG GGA 
Thr Trp Arg Val Thr Asn Leu Met Thr Gly Arg Thr Ala Asn Ala Thr Ser Leu Leu Gly 

ACT GTA CAA GAT GCC TTC CCA AAA TTA TAT TTT GAT CTA TGT GAT CTG GTC GGA GAG GAG 
Thr Val Gin Asp Ala Phe Pro Lys Leu Tyr Phe Asp Leu Cys Asp Leu Val Gly Glu Glu 

TGG GAC CCT TCA GAC CAG GAA CCG TAT GTC GGG TAT GGC TGC AAG TAG CCC GCA GGG AGA 
Trp Asp Pro Ser Asp Gin Glu Pro Tyr Val Gly Tyr Gly Cys Lys Tyr Pro Ala Gly Arg 

CAG CGG ACC CGG ACT TTT GAC TTT TAG GTG TGC CCT GGG CAT ACC GTA AAG TOG GGG TGT 
Gin Arg Thr Arg Thr Phe Asp Phe Tyr Val Cys Pro Gly His . Thr Val Lys Ser Gly Cys 



60 



120 



180 



240 



300 



360 



GGG GGA CCA GGA GAG GGC TAG TGT GGT AAA TGG GGG TGT GAA ACC ACC GGA CAG GCT TAC 420 
Gly Gly Pro Gly Glu Gly Tyr Cys Gly Lys Trp Gly Cys Glu Thr Thr Gly Gin Ala Tyr 

TCG AAG CCC ACA TCA TCG TGG GAC CTA ATC TCC CTT AAG CGC GGT AAC ACC CCC TGG GAC 480 
Trp Lys Pro Thr Ser Ser Trp Asp Leu He Ser Leu Lys Arg Gly Asn Thr Pro Trp Asp 

ACG GGA TGC TCT AAA GTT GCC TGT GGC CCC TGC TAG GAC CTC TCC AAA GTA TCC AAT TCC 540 
Thr Gly Cys Ser Lys Val Ala Cys Gly Pro Cys Tyr Asp Leu Ser Lys Val Ser Asn Ser 

TTC CAA GGG GCT ACT CGA GGG GGC AGA TGC AAC CCT CTA GTC CTA GAA TTC ACT GAT GCA 600 
Phe Gin Gly Ala Thr Arg Gly Gly Arg Cys Asn Pro Leu Val lieu Glu Phe Thr Asp Ala 

GGA AAA AAG GCT AAC TGG GAC GGG CCC AAA TCG TGG GGA CTG AGA CTG TAC CGG ACA GGA 660 
Gly Lys Lys Ala Asn Trp Asp Gly Pro Lys Ser Trp Gly Leu Arg Leu Tyr Arg Thr Gly 

ACA GAT CCT ATT ACC ATG TTC TCC CTG ACC CGG CAG GTC CTT AAT GTG GGA CCC CGA GTC 720 
Thr Asp Pro He Thr Met Phe Ser Leu Thr Arg Gin Val Leu Asn Val Gly Pro Arg Val 

CCC ATA GGG CCC AAC CCA GTA TTA CCC GAC CAA AGA CTC CCT TCC TCA CCA ATA GAG ATT 780 
Pro He Gly Pro Asn Pro Val Leu Pro Asp Gin Arg Leu Pro Ser Ser Pro He Glu He 

GTA CCG GCT CCA CAG CCA CCT AGC CCC CTC AAT ACC AGT TAC CCC CCT TCC ACT ACC AGT 840 
Val Pro Ala Pro Gin Pro Pro Ser Pro Leu Asn Thr Ser Tyr Pro Pro Ser Thr Thr Ser 

ACA CCC CCC CCA GGT ACC GGA GAT AGA CTA CTA GCT CTA GTC AAA GGA GCC TAT CAG GCG 900 
Thr Pro Pro Pro Gly Thr Gly Asp Arg Leu Leu Ala Leu Val Lys Gly Ala Tyr Gin Ala 

CTT AAC CTC ACC AAT CCC GAC AAG ACC CAA GAA TGT TGG CTG TGC TTA GTG TCG GGA CCT 960 
Leu Asn Leu Thr Asn Pro Asp Lys Thr Gin Glu Cys Trp Leu Cys Leu Val Ser Gly Pro 

CCT TAT TAC. GAA GGA GTA GCG GTC GTG GGC ACT TAT ACC AAT CAT TCC ACC GCT CCG GCC 1020 
Pro Tyr Tyr Glu Gly Val Ala Val Val Gly Thr Tyr Thr Asn His Ser Thr Ala Pro Ala 

AAC TGT ACG GCC ACT TCC CAA CAT AAG CTT ACC CTA TCT GAA GTG ACA GGA CAG GGC CTA 1080 
Asn Cys Thr Ala Thr Ser Gin His Lys Leu Thr Leu Ser Glu Val Thr Gly Gin Gly Leu 

TGC ATG GGG GCA GTA CCT AAA ACT CAC CAG GCC TTA TGT AAC ACC ACC CAA AGC GCC GGC 1140 
Cys Met Gly .Ala Val Pro Lys Thr His Gin Ala Leu Cys Asn Thr Thr Gin Ser Ala Gly 



TCA GGA TCC TAC TAC CTT GCA GCA CCC GCC GGA ACA" ATG TGG GCT TGC AGC ACT GGA TTG 
Ser Gly Ser Tyr Tyr Leu Ala Ala Pro Ala Gly Thr Met Trp Ala Cys Ser Thr Gly Leu 
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ACT CCC TGC TTG TCC ACC ACG GTG CTC AAT CTA ACC ACA GAT TAT TGT GTA TTA GTT GAA 1260 
Thx Pro Cys Leu Ser Thr Thr Val Leu Asn Leu Thr Thr Asp Tyr Cys Val Leu Val Glu 

CTC TGG CCC AGA GTA ATT TAC CAC TCC CCC GAT TAT ATG TAT GGT CAG CTt'gAA CAG CGT 1320 
Leu Trp Pro Arg Val lie Tyr His Ser Pro Asp Tyr Met Tyr Gly Gin Leu Glu Gin Arg 

ACC AAA TAT AAA AGA GAG CCA GTA TCA TTG ACC CTG GCC CTT CTA CTA GGA GGA TTA ACC 1380 
Thx Lys Tyr Lys Arg Glu Pro Val Ser Leu Thr Leu Ala Leu Leu Leu Gly Gly Leu Thr 



1440 



1500 



1560 



ATG GGA GGG ATT GCA GCT GGA ATA GGG ACG GGG ACC ACT GCC TTA ATT AAA ACC CAG CAG 
Met Gly Gly lie Ala Ala Gly lie Gly Thr Gly Thr Thr Ala Leu lie Lys Thr Gin Gin 

TTT GAG CAG CTT CAT GCC GCT ATC CAG ACA GAC CTC AAC GAA GTC GAA AAG TCA ATT ACC 
Phe Glu Gin Leu His Ala Ala lie Gin Thr Asp Leu Asn Glu Val Glu Lys Ser lie Thr 

AAC CTA GAA AAG TCA CTG ACC TOG TTG TCT GAA GTA GTC CTA CAG AAC CGC AGA GGC CTA 
Asn Leu Glu Lys Ser Leu Thr Ser Leu Ser Glu Val Val Leu Gin Asn Arg Arg Gly Leu 

GAT TTG CTA TTC CTA AAG GAG GGA GGT CTC TGC GCA GCC CTA AAA GAA GAA TGT TGT TTT 1620 
Asp Leu Leu Phe Leu Lys Glu Gly Gly Leu Cys Ala Ala Leu Lys Glu Glu Cys Cys Phe 

^^^^^^^^^^AGCATOGCCAAATTAA^GAA AGG CTT AAT 1680 
Tyr Ala Asp His Thr Gly Leu Val Arg Asp Ser Met Ala Lys Leu Arg Glu Arg Leu Asn 

CTA TTT GAG ACA GGC CAA GGA TGG TTC GAA GGG CTG TTT AAT AGA TCC 1740 
Gin Arg Gin Lys Leu Phe Glu Thr Gly Gin Gly Trp Phe Glu Gly Leu Phe Asn Arg Ser 

Pt-^ ntK^" nfc^ TTA ATC TCC ACC ATC ATG GGA OCT CTA ATA GTA CTC TTA CTG ATC 1800 

Pro Trp Phe Thr Thr Leu He Ser Thr He Met Gly Pro Leu He Val Leu Leu Leu He 

TTA CTC TTT GGA OCT TGC ATT CTC AAT OGA TTG. GTC CAA TTT GTT AAA GAC AGG ATC TCA 1860 
Leu Leu Phe Gly Pro Cys He Leu Asn Arg Leu Val Gin Phe Val Lys Asp Arg He Ser 

GTG GTC CAG GCT CTG 

Val Val Gin Ala Leu 1875 
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123 456 789 012 345 678 901 234 567 890 123 456 789 012 345 678 901 234 567 890' 
ATG GCG CGT TCA ACG CTC TCA AAA CCC CCT CAA GAT AAG ATT AAC CCG TGG AAG CCC TTA60 

Het Ala Arg Ser Thr Leu Ser Lys Pro Pro Gin Asp Lys lie Asn Pro Trp Lys Pro Leu 

! 

ATA GTC ATG GGA GTC CTG TTA GGA GTA GGG ATG GCA GAG AGC CCC CAT CAG GTC TTT AAT20 
lie Val Met Gly Val Leu Leu Gly Val Gly Met Ala Glu Ser Pro His Gin Val Phe Asn 

GTA ACC TGG AGA GTC ACC AAC CTG ATG ACT GGG CGT ACC GCC AAT GCC ACC TCC CTC CTG 80 
Val Thr Trp Arg Val Thr Asn Leu Met Thr Gly Arg Thr Ala Asn Ala Thr Ser Leu Leu 

GGA ACT GTA CAA GAT GCC TTC CCA f^AA TTA TAT TTT GAT CTA TGT GAT CTG GTC GGA GAG 40 
Gly Thr Val Gin Asp Ala Phe Pro Lys Leu Tyr Phe Asp Leu Cys Asp Leu Val Gly Glu 



GAG TGG GAC CCT TCA GAC CAG GAA fXG TAT GTC GGG TAT 


GGC 


TGC 


AAG 


TAC 


CCC 


GCA 


GGG00 


Glu Trp Asp Pro Ser Asp Gin Glu pro Tyr Val Gly Tyr 

1 


Gly 


Cys 


Lya 


Tyr Pro 


Ala 


Gly 


AGA CAG CGG ACC CGG ACT TTT GAC ITT TAC GTG TGC CCT 


GGG 


CAT 


ACC 


GTA AAG 


TCG 


GGG 60 


Arg Gin Arg Thr Arg Thr Phe Asp Phe Tyr Val Cys Pro 


Gly 


His 


Thr 


Val 


Lys 


Ser 


Gly 


TOT GGG GGA CCA GGA GAG GGC TAC TGT GGT AAA TGG GGG 


TGT 


GAA 


ACC 


ACC 


GGA 


CAG 


GC520 


Cys Gly Gly Pro Gly Glu Gly Tyr Cys Gly Lys Trp Gly 


Cys 


Glu 


Thr 


Thr 


Gly 


Gin 


Ala 


TAC TGG AAG CCC ACA TCA TCG TGG GAC CTA ATC TCC CTT 


AAG 


CGC 


GGT 


AAC 


ACC 


CCC 


TQ380 


Tyr Trp Lys Pro Thr Ser Ser Trp Asp Leu He Ser Leu 


Lys 


Arg 


Gly 


Asn 


Thr 


Pro 


Trp 


GAC ACG GGA TGC TCT AAA GTT GCC TGT GGC CCC TGC TAC 


GAC 


CTC 


TCC 


AAA 


GTA 


TCC 


AAT40 


Asp Thr Gly Cys Ser Lys Val Ala Cys Gly Pro Cys Tyr 


Asp 


Leu 


Ser 


Lys 


Val 


Ser 


Asn 


1 

TCC TTC CAA GGG GCT ACT CGA GGG GGC AGA TGC AAC CCT 


CTA 


GTC 


CTA 


GAA 


TTC 


ACT 


GAB 00 


Ser Phe Gin Gly Ala Thr Arg Gly Gly Arg Cys Asn Pro 


Leu 


Val 


Leu 


Glu 


Phe 


Thr 


Asp 



GGA ACA GAT CCT ATT ACC ATG TTC 
Gly Thr Asp Pro He Thr Met Phe 

GTC CCC ATA GGG CCC AAC CCA GTA 
Val Pro He Gly Pro Asn Pro Val 



ACA CAT CAG GCC CTA TGT AAT ACC 
Thr His Gin Ala Leu Cys Asn Thr 



GGA 
Gly 


CTG 
Leu 


AGA 
Arg 


CTG 
Leu 


TAC 
Tyr 


CGG 


AOS 60 
Thr 


GTC 
Val 


CTT 
Leu 


AAT 
Asn 


GTG 
Val 


GGA 

Gly 


CCC 

pro 


CG&20 
Arg 


CTC 
Leu 


CCT 
Pro 


TCC 
Ser 


TCA 
Ser 


CCA 
Pro 


ATA 
lie 


GAG 80 
Glu 


AGT 
Ser 


TAC 
Tyr 


CCC 
Pro 


CCT 
Pro 


TCC 
Ser 


ACT 
Thr 


ACQ 40 
Thr 


CAG 
Gin 


CCA 
Pro 


CCC 
Pro 


CCA 
Pro 


GGT 
Gly 


ACC 
Thr 


GAJS00 
GlU 


CTC 
Leu 


AAC 
Asn 


CTC 
Leu 


ACC 
Thr 


AGT 
Ser 


CCT 
Pro 


GAG 60 
Asp 


CCC 
Pro 


TAC 
Tyr 


TAC 
Tyr 


GAA 
Glu 


GGG 
Gly 


GTT 
Val 


GOJD20 
Ala 


AAC 
Asn 


TGC 

Cys 


TCC 
Ser 


GTG 
Val 


GCC 
Ala 


TCC 
Ser 


CA2D80 
Gin 


TGC 
Cys 


ATA 
He 


GGA 
Gly 


GCA 
Ala 


GTT 
Val 


CCC 
Pro 


AAA40 

Lys 


CGA 
Arg 


GGG 
Gly 


TCC 
Ser 


TAT 
Tyr 


TAT 
Tyr 


CTA 
Leu 


GXE00 
Val 
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123 456 739 012 345 678 901 234 567 890 123 456 789 012 345 678 901 234 567 890 
GCC CCT AC A GGT ACC ATG TGG GCT TGT AGT ACC GGG CTT ACT CCA TGC ATC TCC ACC Ada60 
Ala Pro Thr Gly Thr Met Trp Ala Cys Ser Thr Gly Leu Thr Pro Cys He Ser Thr Thr 

ATA CTG AAC CTT ACC ACT GAT TAT TGT GTT CTT GTC GAA CTC TGG CCA AGA GTC ACC TA320 
He Leu Asn Leu Thr Thr Asp Tyr Cys Val Leu Val Glu Leu Trp Pro Arg Val Thr Tyr 

CAT TCC CCC AGC TAT GTT TAG GGC CTG TTT GAG AGA TCC AAC CGA CAC AAA AGA GAA COG80 
His Ser Pro Ser Tyr Val . Tyr Gly Leu Phe Glu Arg Ser Asn Arg His Lys Arg Glu Pro 

GTG TCG TTA ACC CTG GCC CTA TTA TTG GGT GGA CTA ACC ATG GGA GGG ATT GCA GCT GGA 40 - 
Val Ser Leu Thr Leu Ala Leu Leu Leu Gly Gly Leu Thr Met Gly Gly He Ala Ala Gly 

ATA GGG ACG GGG ACC ACT GCC TTA ATT AAA ACC CAG CAG TTT GAG CAG COT CAT GCC GCTBOO 
lie Gly Thr Gly Thr Thr Ala Leu He Lys Thr Gin Gin Phe Glu Glh Leu His Ala Ala 

ATC CAG ACA GAC CTC AAC GAA GTC GAA AAG TCA ATT ACC AAC CTA GAA AAG TCA CTG AOO60 
lie Gin Thr Asp Leu Asn Glu Val Glu Lys Ser He Thr Asn Leu Glu Lys Ser Leu Thr 

TCG TTG TCT GAA GTA GTC CTA CAG £AC CGC AGA GGC CTA GAT TTG CTA TTC CTA AAG G2UB2 0 
Ser Leu Ser Glu Val Val Leu Gin Ksn Arg Arg Gly Leu Asp Leu Leu Phe Leu Lys Glu 

GGA GGT CTC TGC GCA GCC CTA AAA J3AA GAA TGT TGT OTT TAT GCA GAC CAC ACG GGG C12S80 
Gly Gly Leu Cys Ala Ala Leu Lys plu Glu Cys Cys Phe Tyr Ala Asp His Thr Gly Leu 

GTG AGA GAC AGC ATG GCC AAA TTA AGA GAA AGG COT AAT CAG AGA CAA AAA GTA TTT GAG 4 0 
Val Arg Asp Ser Met Ala Lys Leu Arg Glu Arg Leu Asn Gin Arg Gin Lys Leu Phe Glu 

ACA GGC CAA GGA TGG TTC GAA GGG CTG TTT AAT AGA TCC CCC TGG TTT ACC ACC TTA AT0OO 
Thr Gly Gin Gly Trp Phe Glu Gly Leu Phe Asn Arg Ser Pro Trp Phe Thr Thr Leu He 

TCC ACC ATC ATG GGA CCT CTA ATA GTA CTC TTA CTG ATC TTA CTC TTT GGA CCT TGC ATO60 
Ser Thr He Met Gly Pro Leu He Val Leu Leu Leu He Leu Leu Phe Gly Pro Cys He 

CTC AAT CGA TTA GTT CAA TTT GOT AAA GAC AGG ATC TCA GTA GTC CAG GCT TTA GTC CTO20 
Leu Asn Arg Leu Val Gin Phe Val Lys Asp Arg He Ser Val Val Gin Ala Leu Val Leu 

ACT CAA CAA TAC CAC CAG CTA AAA CCA CTA GAA TAG GAG CCA 1952 
Thr Gin Gin. Tyr His Gin Leu Lys Pro Leu Glu Tyr Glu Pro 
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ATG GCG CGT TCA ACG CTC TCA AAA pCC CCT CAA GAT AAG ATT AAC CCG TGG AAG CCC TTA60 
Met Ala Arg Ser Thr Leu Ser Lys pro Pro Gin Asp Lys He Asn Pro Trp Lys Pro Leu 

ATA GTC ATG GGA GTC CTG TTA GGA GTA GGG ATG GCA GAG AGC CCC CAG GTC TTT AAT GTA2 0 
He Val Met Gly Val Leu Leu Gly Val Gly Met Ala Glu Ser Pro Gin Val Phe Asn Val 

i" 

ACC TGG AGA GTC ACC AAC CTG ATG ACT GGG CGT ACC GCC AAT GCC ACC TCC CTC CTG GGA 80 
Thr Trp Arg Val Thr Asn Leu Met Thr Gly Arg Thr Ala Asn Ala Thr Ser Leu Leu Gly 

ACT GTA CAA GAT GCC TTC CCA AAA TTA TAT TTT GAT CTA TGT GAT CTG GTC GGA GAG GAG40 
Thr Val Gin Asp Ala Phe Pro Lys Leu Tyr Phe Asp Leu Cys Asp Leu Val Gly Glu Glu 

TGG GAC CCT TCA GAC CAG GAA CCG TAT GTC GGG TAT GGC TGC AAG TAC CCC GCA GGG AGS 00 
Trp Asp Pro Ser Asp Gin Glu Pro Tyr Val Gly Tyr Gly Cys Lys Tyr Pro Ala Gly Arg 

CAG CGG ACC CGG ACT TTT GAC TTT TAC GTG TGC CCT GGG CAT ACC GTA AAG TCG GGG TGT 60 
Gin Arg Thr Arg Thr Phe Asp Phe .Tyr Val Cys Pro Gly His Thr Val Lys Ser Gly Cys 

GGG GGA CCA GGA GAG GGC TAC TGT GGT AAA TGG GGG TGT GAA ACC ACC GGA CAG GCT TA<32 0 
Gly Gly Pro Gly Glu Gly Tyr Cys ply Lys Trp Gly Cys Glu Thr Thr Gly Gin Ala Tyr 

TGG AAG CCC ACA TCA TCG TGG GAC fcTA ATC TCC CTT AAG CGC GGT AAC ACC CCC TGG GAG 80 
Trp Lys Pro Thr Ser Ser Trp Asp j&eu He Ser Leu Lys Arg Gly Asn Thr Pro Trp Asp 

ACG GGA TGC TCT AAA GOT GCC TGT j3GC CCC TGC TAC GAC CTC TCC AAA GTA TCC AAT TO340 
Thr Gly Cys Ser Lys Val Ala Cys bly Pro Cys Tyr- Asp Leu Ser Lys Val Ser Asn Ser 

TTC CAA GGG GCT ACT CGA GGG GGC AGA TGC AAC CCT CTA GTC CTA GAA TTC ACT GAT GCJS00 
Phe Gin Gly Ala Thr . Arg Gly Gly Arg Cys Asn Pro Leu Val Leu Glu Phe Thr Asp Ala 

GGA AAA AAG GCT AAC TGG GAC GGG CCC AAA TCG TGG GGA C*G AGA CTG TAC CGG ACA GGA 60 
Gly Lys Lys Ala Asn Trp Asp Gly Pro Lys Ser Trp Gly Leu Arg Leu Tyr Arg Thr Gly 

ACA GAT CCT ATT ACC ATG TTC TCC CTG ACC CGG CAG GTC CTT AAT GTG GGA CCC CGA GTCT2 0 
Thr Asp Pro He Thr Met. Phe ser Leu Thr Arg Gin Val Leu Asn Val Gly Pro Arg Val 

CCC ATA GGG CCC AAC CCA GTA TTA CCC GAC CAA AGA CTC CCT TCC TCA CCA ATA GAG ATT80 
Pro He Gly Pro Asn Pro Val Leu tro Asp Gin Arg Leu Pro Ser Ser Pro He Glu He 

GTA CCG GCT CCA CAG CCA CCT AGC CCC CTC AAT ACC AGT TAC CCC CCT TCC ACT ACC AG«S4 0 
Val Pro Ala Pro Gin Pro Pro Ser Pro Leu Asn Thr Ser Tyr Pro Pro Ser Thr Thr Ser 

ACA CCC TCA ACC TCC CCT ACA AGT CCA AGT GTC CCA CAG CCA CCC CCA GGT ACC GAA AAT 00 
Thr Pro ser Thr Ser Pro Thr Ser (Pro ser Val Pro Gin Pro Pro Pro Gly Thr Glu Asn 

AGG CTG CTA AAC TTA GTA GAC GGA fecc TAC CAA GCC CTC AAC CTC ACC AGT CCT GAC AAS60 
Arg Leu Leu Asn Leu Val Asp Gly £J.a Tyr Gin Ala Leu Asn Leu Thr Ser Pro Asp Lys 

ACC CAA GAG TGC TGG TTG TGT CTA JSTA ' GCG GGA CCC CCC TAC TAC GAA GGG GTT GCC G3BD20 
Thr Gin Glu Cys Trp Leu Cys Leu fcal Ala Gly Pro Pro Tyr Tyr Glu Gly Val Ala Val 

CTG GGT ACC TAC TCC AAC CAT ACC TCT GCT CCA GCC AAC TGC TCC GTG GCC TCC CAA CA08O 
Leu Gly Thr Tyr Ser Asn His Thr Ser Ala Pro Ala Asn Cys Ser Val Ala ser Gin His 

AAG TTG ACC CTG TCC GAA GTG ACC GGA CAG GGA CTC TGC AT?A GGA GCA GTT CCC AAA AflA4 0 
Lys Leu Thr Leu Ser Glu Val Thr Gly Gin Gly Leu Cys He Gly Ala Val Pro Lys Thr 

CAT CAG GCC CTA TGT AAT ACC ACC CAG ACA AGC AGT CGA GGG TCC TAT TAT CTA GTT GOaOO 
His Gin Ala Leu Cys Asn Thr Thr Gin Thr Ser Ser Arg Gly Ser Tyr Tyr Leu Val Ala 
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CCT AC A GGT ACC ATG TGG GCT TGT AGT ACC GGG CTT ACT CCA TGC ATC TCC ACC ACC A1260 

Pro Thr Gly Thr Met Trp Ala Cys Ser Thr Gly Leu Thr Pro Cys He Ser Thr Thr He 

CTG AAC CTT ACC ACT GAT TAT TGT j3TT CTT GTC GAA CTC TGG CCA AGA GTC ACC TAT CAE 2 0 
Leu Asn Leu Thr Thr Asp Tyr Cys yal Leu Val Glu Leu Trp Pro Arg Val Thr Tyr His 

TCC CCC AGC TAT GTT TAC GGC CTG JTTT GAG AGA TCC AAC CGA CAC AAA AGA GAA CCG GTG80 
Ser Pro Ser Tyr Val Tyr Gly Leu phe Glu Arg Ser Asn Arg His Lys Arg Glu Pro Val 

TCG TTA ACC CTG GCC CTA TTA TTG fcSGT GGA CTA ACC ATG GGA GGG ATT GCA GCT GGA AM40 
Ser Leu Thr Leu Ala Leu Leu Leu Gly Gly Leu Thr Met Gly Gly lie Ala Ala Gly lie 

GGG ACG GGG ACC ACT GCC TTA ATT AAA ACC CAG CAG TTT GAG CAG CTT CAT GCC GCT ATS 00 
Gly Thr Gly Thr Thr Ala Leu He Lys Thr Gin Gin Phe Glu Gin Leu His Ala Ala lie 

CAG ACA GAC CTC AAC GAA GTC GAA AAG TCA ATT ACC AAC CTA GAA AAG TCA CTG ACC TCK360 
Gin Thr Asp Leu Asn Glu Val Glu Lys Ser lie Thr Asn Leu Glu Lys Ser Leu Thr Ser 

TTG TCT GAA GTA GTC CTA. CAG AAC CGC AGA GGC CTA GAT TTG CTA TTC CTA AAG GAG GO3S20 
Leu Ser Glu Val Val Leu Gin Asn Arg Arg Gly Leu Asp Leu Leu Phe Leu Lys Glu Gly 

GGT CTC TGC GCA GCC CTA AAA GAA GAA TGT TGT TTT TAT GCA GAC CAC ACG GGG CTA CM 80 
Gly Leu Cys Ala Ala Leu Lys Glu Glu Cys Cys Phe Tyr Ala Asp His Thr Gly Leu Val 

AGA GAC AGC ATG GCC AAA TTA AGA GAA AGG CTT AAT CAG AGA CAA AAA CTA TTT GAG AOS40 
Arg Asp Ser Met. Ala Lys Leu Arg Glu Arg Leu Asn Gin Arg Gin Lys Leu Phe Glu Thr 

GGC CAA GGA TGG TTC GAA GGG CTG TTT AAT AGA TCC CCC TGG TTT ACC ACC TTA ATC TCKS00 
Gly Gin Gly Trp Phe Glu Gly Leu phe Asn Arg Ser Pro Trp Phe Thr Thr Leu He Ser 

ACC ATC ATG GGA CCT CTA ATA GTA CTC TTA CTG ATC TTA CTC TTT GGA CCT TGC ATT COB260 
Thr He Met Gly Pro Leu He Val [Leu Leu Leu He Leu Leu Phe Gly Pro Cys lie Leu 

AAT CGA TTA GTT CAA TTT GTT AAA f3AO AGG ATC TCA GTA GTC CAG GCT TTA GTC CTG AOS20 
Asn Arg Leu Val Gin Phe Val Lys £sp Arg He Ser Val Val flin Ala Leu Val Leu Thr 

CAA CAA TAC CAC CAG CTA AAA CCA CTA GAA TAC GAG CCA 1959 
Gin Gin Tyr His Gin Leu Lys Pro Leu Glu Tyr Glu Pro 
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1 23 456 789 012 345 678 901 234 S67 890 123 456 789 012 345 678 901 234 567 890 ' 
ATG GCG CGT TCA ACG CTC TCA AAA pCC CCT CAA GAT AAG ATT AAC CCG TGG AAG CCC TTA 60 
Met Ala Arg Ser Thr Leu Ser Lys Pro Pro Gin Asp Lys lie Asa Pro Trp Lys Pro Leu 

ATA GTC ATG GGA GTC CTG TTA GGA .GTA GGG ATG GCA GAG AGC CCC CAT CAG GTC TTT AAT20- 
Xle Val Met Gly Val Leu Leu Gly Val Gly Met Ala Glu Ser Pro His Gin Val Phe Asn 

GTA ACC TGG AGA GTC ACC AAC CTG kTG ACT GGG CGT ACC GCC AAT GCC ACC TCG CTC CTG 80 
Val Thr Txp Arg Val Thr Asn Leu jfet Thr Gly Arg Thr Ala Asa Ala. Thr &er/ Leu . Leu . 1 

. <* '• " ... , ■ f ; 

GGA ACT GTA CAA GAT GCC TTC CCA fevAA TTA TAT TTT GAT CTA TGT GAT CTG GTC GGA GAG 40 
Gly Thr Val Gin Asp Ala Phe Pro pys Leu Tyr Phe Asp Leu Cys Asp Leu Val Gly Glu 

GAG TGG GAC CCT TCA GAC CAG GAA CCG TAT GTC GGG TAT GGC TGC AAG TAC CCC GCA GGG00 
Glu Trp Asp Pro Ser Asp Gin Glu Pro Tyr Val Gly Tyr Gly Cys Lys Tyr Pro Ala Gly 

AGA CAG CGG ACC CGG ACT TTT GAC TTT TAC GTG TGC CCT GGG CAT ACC GTA AAG TCG GGG60 
Arg Gin Arg Thr Arg Thr Phe Asp Phe Tyr Val Cys Pro Gly His Thr Val Lys Ser Gly 

TGT GGG GGA CCA GGA GAG GGC TAC TGT GGT AAA. TGQ. .<&Q TGT GAA ACC ACC GGA CAG GC120 
cys Gly Gly Pro Gly Glu Gly Tyr Cys Gly Lys Trp Gly Cys Glu Thr Thr Gly Gin Ala 

TAC TGG AAG CCC ACA TCA, TCG TGG GAC CTA ATC TCC CTT AAG CGC GGT AAC ACC CCC TG380 
Tyr Trp Lys Pro Thr Ser Ser Trp Asp Leu lie Ser Leu £ys Arg Gly Asn Thr Pro Tro 

GAC ACG GGA TGC TCT AAA GTT GCC }rGT GGC CCC TGC TAC GAC CTC TCC AAA GTA 'TCC AA'540 
Asp Thr Gly cys Ser Lys Val Ala cys Gly Pro Cys Tyr Asp Leu Ser Lys Val Ser Asn 

TCC TTC CAA GGG GCT ACT CGA GGG GGC AGA TGC AAC CCT CTA GTC CTA GAA TTC ACT GA'BOO 
Ser pk e Gin Gly Ala Thr Arg Gly 'Gly Arg Cys Asn Pro Leu Val Leu Glu Phe Thr Asp 

GCA GGA AAA AAG GCT AAC TGG GAC bcG CCC AAA TCG TGG GGA CTG AGA CTG %AC CGG ftCJggO 
Ala Gly Lys Lys Ala Asn Trp Asp ply Pro Lys Ser Trp Gly Leu Arg Leu Tyr Arg Thr' 

GGA ACA GAT CCT ATT ACC ATG TTC [TCC CTG ACC CGG CAG GTC CTT ££T GTG GGA CCC C$220 
Gly Thr Asp Pro lie Thr Met Phe per Leu Thr Arg Gin Val Leu isn Val Gly Pro £rg 1 

GTC CCC ATA GGG CCC AAC CCA GTA jlTA CCC GAC CAA AGA CTC CCT TCC TCA CCA ATA GAG 80 
Val Pro lie Gly Pro Asn Pro Val Leu Pro Asp Gin Arg Leu Pro Ser Ser Pro lie G^u 

ATT GTA CCG GCT CCA CAG CCA CCT AGC CCC CTC AAT ACC AGT TAC CC£ CCT ygCJ £CT £Cg$p 
lie Val Pro Ala Pro Gin Pro Pro Ser Pro Leu Asn Thr Ser Tyr Pro Pro SerTflar Thr 

AGT ACA CCC TCA ACC TCC CCT ACA AGT CCA AGT GTC CCA CAG CCA CCJC CCA GGA ACT GG$00 
Ser Thr Pro Ser Thr Ser Pro Thr Ser Pro Ser Val Pro Gin Pro Pro Pro Gly Thr Gly 

GAT AGA CTA CTA GCT CTA GTC AAA GGA GCC TAT CAG GCG CTT AAC CTC ACC AAT CCC GA$60 
Asp Arg Leu Leu Ala Leu Val Lys Gly Ala Tyr Gin Ala Leu Asn Leu Thr Asn Pro Asp 

AAG ACC CAA GAA TGT TGG CTG TGC JTTA GTG TCG GGA CCT CCT TAT TAC GAA GGA GTA GCLCB20* 
Lys Thr Gin Glu Cys Trp Leu Cys &eu Val Ser Gly Pro Pro Tyr Tyr Glu Gly Val Ala 

GTC GTG GGC ACT TAT ACC AAT CAT TCC ACC GCT CCG GCC AAC TGT ACG GCC ACT TCC Ca»80 
Val Val Gly Thr Tyr Thr Asn His £er Thr Ala Pro. Ala Asn Cys Thr Ala TJir Ser Gin 

CAT AAG CTT ACC CTA TCT GAA GTG ACA GGA CAG GGC CTA TGC ATG GGG GCA GTA CCT AA^.40 
His Lys Leu Thr Leu Ser Glu Val frhr Gly Gin Gly Leu Cys Met Gly Ala Val Pro Lys 

ACT CAC CAG GCC TTA TGT AAC ACC kCC CAA. AGC GCC GGC TCA GGA TCC TAC TAC QTT GCL200 
Thr His Gin Ala Leu Cys Asn Thr fPhr Gin Ser Ala Gly Ser Gly Ser Tyr Tyr Leu Ala 
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GCA CCC GCC GGA ACA ATG TGG GCT TGC AGC ACT GGA TTG ACT CCC TGC 1TG TCC ACC ACLG60 
Ala Pro Ala Gly Thr Met Trp Ala Cys Ser Thr Gly Leu Tlir Pro Cys Leu Ser Thr Thr 

GTG CTC AAT CTA ACC ACA GAT TAT TGT GTA TTA GTT GAA CTC TGG ' CCC AGA GTA ACT TAO20 
Val Leu Asn Leu Thr Thr Asp Tyr Cys Val Leu Val Glu Leu Trp Pro Arg Val lie Tyr 

CAC TCC CCC GAT TAT ATG TAT GGT CAG CCT GAA CAG CGT ACC AAA TAT AAA AGA GAG COflSO 
His Ser Pro Asp Tyr Met Tyr Gly Glu Leu Glu Gin Arg Thr Lys Tyr Lys Arg \©lu Pro 

GTA TCA TTG ACC CTG GCC CTT CTA CTA GGA GGA TTA ACC ATG GGG GGA ATT GCC GCT GGA 40 
Val Ser Leu Thr Leu Ala Leu Leu Leu Gly Gly Leu Thr Met Gly Gly He Ala Ala Gly 

ATA GGA ACA GGG ACT ACT GCT CTA ATG GCC ACT CAG CAA TTC CAG CAG. CTC CAA GCC GO500 
He Gly Thr Gly Thr Thr Ala Leu Met Ala Thr Gin Gin Phe Gin Gin Leu Gin Ala Ala 

GTA CAG GAT GAT CTC AGG GAG GTT pAA AAA TCA ATC TCT AAC CTA GAA AAG~TCT?CTC AOT60 
Val Gin Asp Asp Leu Arg Glu Val jslu Lys Ser He Ser Asn Leu Glu Lys Ser Leu Thr 



TCC CTG TCT GAA GTT GTC CTA CAG 
Ser Leu Ser Glu Val Val Leu Gin 



&AT C<£A AGG GGC <JTA GAC CTG CTA TTT CTA A Efr G&J620 
Asn Arg Arg Gly Leu Asp Leu Leu Phe Leu Lys Glu 

GGA GGG CTG TGT GCT GCT CTA AAA GAA GAA TGT TGC TTC TAT GCG GAC CAC ACA GGA CIS 80 
Gly Gly Leu Cys Ala Ala Leu Lys Glu Glu Cys Cys Phe.iyr Ala Asp pi^ Thr Gly Leu - 

GTG AGA GAC AGC ATG GCC AAA CTG AGA GAG AGG CTT AAT CAG AGA • CAG "AAA CTC '*FET * GAG4 0 *' 
Val Arg Asp Ser Met Ala Lys Leu Arg Glu Arg Leu Asn Gin £r ? Gin Lys Leu ?he GJjel ' ' 

TCA ACT CAA GGA TGG TTT GAG GGA CTG TTT AAC AGA TCC CCT TGG TCT ACC ACC TTG A1S00 
Ser Thr Gin Gly Trp Phe Glu Gly Leu Phe Asn Arg Ser Pro Trp Phe Thr Thr Leu He 

TCT ACC ATT ATG GGA CCC CTC ACT GTA CTC CTA ATG ATT CTG CTC CTC GGA CCC TGC AXB60 
Ser Thr He Met Gly Pro Leu He Val Leu Leu Met He Leu Leu Phe Gly Pro Cys He 

CCT AAT CGA TTA GCT CAA CTT GCT £AA GAC AGG ATC TCA GTA GTC CAG GCT CTA GTC CTO20 
Leu Asn Arg Leu Val Gin Phe Val Lys Asp Arg He Ser Val Val Gin Ala Leu Val Leu * 

ACT CAA CAA TAG CAC CAG CTA AAA .CCA CTA GAA TAG GAG CCA 1962 
Thar Gin Gin Tyr His Gin Leu Lys Pro Leu Glu Tyr Glu Pro 
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ATG GCG CGT TCA ACG CTC TCA AAA ZCP CCT CAA GAT AAG ATT AAC CCG TGG AAG CCC TTA60 
Met Ala Arg Ser Thr Leu Ser Lys (pro Pro Gin Asp Lys He Asn Pro Trp Lys Pro Leu 

ATA GTC ATG GGA GTC CTG TTA GGA GTA GGG ATG GCA GAG AGC CCC CAG GTC TTT AAT GTA20 
He Val Met Gly Val Leu Leu Gly Val Gly Met Ala Glu Ser Pro Gin Val Phe Asn Val 

ACC TGG AGA GTC ACC AAC CTG ATG ACT GGG CGT ACC GCC AAT GCC ACC TCC CTC CTG GGA80 
Thr Trp Arg Val Thr Asn Leu Met Thr Gly Arg Thr Ala Asn Ala Thr Ser Leu Leu Gly 

ACT GTA CAA GAT GCC TTC CCA AAA TTA TAT TTT GAT CTA TGT GAT CTG GTC GGA GAG GAG 40 
Thr Val Gin Asp Ala Phe Pro Lys Leu Tyr Phe Asp Leu Cys Asp Leu Val Gly Glu Glu 

TGG GAC CCT TCA GAC CAG GAA CCG TAT GTC GGG TAT GGC TGC AAG TAC CCC GCA GGG AGS00 
Trp Asp Pro Ser Asp Gin Glu Pro Tyr Val Gly Tyr Gly Cys Lys Tyr Pro Ala Gly Arg 

CAG CGG ACC CGG ACT TTT GAC TTT TAC GTG TGC CCT GGG CAT ACC GTA AAG TCG GGG TG360 
Gin Arg Thr Arg Thr Phe Asp Phe Tyr Val Cys Prb Gly His Thr Val Lys Ser Gly Cys 

GGG GGA CCA GGA GAG GGC TAC TGT GGT AAA TGG GGG TGT GAA ACC ACC GGA CAG GCT TA32 0 
Gly Gly Pro Gly Glu Gly Tyr Cys fely Lys Trp Gly cys Glu Thr Thr Gly Gin Ala Tyr 

TGG AAG CCC ACA TCA TCG TGG GAC pTA ATC TCC CTT AAG GGC GGT AAC ACC CCC TGG GAS 80 
Trp Lys Pro Thr Ser Ser Trp Asp (Leu He Ser Leu Lys Arg Gly Asn Thr Pro~Trp Asp 

ACG GGA TGC TCT AAA GTT GCC TGT )3GC CCC TGC TAC GAC CTC TCC rAAA CTA TCC : AWTCO40 - 
Thr Gly Cys Ser Lys Val Ala Cys f3ly Pro cys Tyr Asp Leu Ser £ys Val Ser Asn Ser 

TTC CAA GGG GCT ACT CGA GGG GGC AGA TGC AAC CCT CTA GTC CTA GAA TTC ACT GAT GC*00 
Phe Gin Gly Ala Thr Arg Gly Gly Krg Cys Asn Pro Leu Val Leu Glu Phe Thr Asp Ala^ 

GGA AAA AAG GCT AAC TGG GAC GGG CCC AAA TCG TGG GGA CTG AGA CTG TAC CGG ACA GGJ560 
Gly Lys Lys Ala Asn Trp Asp Gly Pro Lys Ser Trp Gly Leu Arg £eu £yr ^rcj Thr Gty 

ACA GAT CCT ATT ACC ATG TTC TCC CTG ACC CGG CAG GTC CTT AAT GTG GGA CCC CGA GTQ?0 
Thr Asp Pro He Thr Met Phe Ser Leu Thr Arg Gin Val Leu Asn Val Gly Pro Arg Vai ' 

CCC ATA GGG CCC AAC CCA GTA TTA CCC GAC CAA AGA CTC CCT TCC TCA CCA ATA GAG ATT 80 
Pro He Gly Pro Asn Pro Val Leu Pro Asp Gin Arg Leu Pro Ser Ser Pro He Glu He 

GTA CGG GCT CCA CAG CCA CCT AGC bcc CTC AAT ACC AGT TAC CCC CCT TCC ACT ACC AOT40 
Val Pro Ala Pro Gin Pro Pro Ser Pro Leu Asn Thr Ser Tyr Pro Pro Ser Thr. Thr Ser 

ACA CCC TCA ACC TCC CCT ACA AGT CCA AGT GTC CCA CAG CCA CCC CCA GGA ACT GGA GAT00 
Thr Pro Ser Thr Ser Pro Thr Ser pro Ser Val Pro Gin Pro Pro Pro Gly Thr Gly Asp 

AGA CTA CTA GCT CTA GTC AAA GGA Jj3CC TAT CAG GCG CTT AAC CTC ACC AAT CCC GAC AAG 60 



Arg Leu Leu Ala Leu Val Lys Gly jala Tyr Gin Ala Leu Asn Leu Thr Asn Pro Asp Lys 

1 • i . A,' \ * ; ; ?,f fc£p 

ACC CAA GAA TGT TGG CTG TGC TTA bTG TCG GGA CCT CCT TAT TAC GAA GGA OTA GCG G10>< 
Thr Gin Glu Cys Trp Leu Cys Leu J/al Ser Gly Pro Pro Tyr Tyr Glu Gly Val Ala vit 



GTG GGC ACT TAT ACC AAT CAT TCC ACC GCT CCG GCC AAC TGT ACG GCC ACT TCC CAA C2L0580 
Val Gly Thr Tyr Thr Asn His Ser Thr Ala Pro Ala' Asn Cys Thr Ala Thr Ser Gin His 

AAG CTT ACC CTA TCT GAA GTG ACA GGA CAG GGC CTA TGC ATG GGG GCA GTA CCT AAA AOS 40 
Lys Leu Thr Leu Ser Glu Val Thr Gly Gin Gly Leu Cys Met Gly Ala Val Pro Lys Thr^ 

CAC CAG GCC TTA TGT AAC ACC ACC CAA AGC GCC GGC TCA GGA TCC TAC TAC CTT GCA GO200 
His Gin Ala Leu Cys Asn Thr Thr Gin Ser . Ala Gly Ser Gjy Ser Syr ?yr Leu Ala £U 
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CCC GCC GGA AC A ATG TGG GCT TGC AGC ACT GGA TTG ACT CCC TGC TTG TCC ACC ACG GM60 
Pro Ala Gly Thr Met Trp Ala Cys .Ser Thr Gly Leu Thr Pro Cys Leu Ser Thr Thr Val 

CTC AAT CTA ACC ACA GAT TAT TGT GTA TTA GTT GAA CTC TGG CCC AGA GTA ATT TAC C4O20 
Leu Asn Leu Thr Thr Asp Tyr Cys j/al Leu Val Glu Leu Trp Pro Arg Val lie Tyr His 

TCC CCC GAT TAT ATG TAT GGT CAG tTT GAA CAG CGT ACC AAA TAT AAA ASA- SftS^CA G^SO 
Ser Pro Asp Tyr Met Tyr Gly Gin jceu Glu Gin Arg Thr Lys Tyr Lys Arg Glu.^ro Val ' 

TCA TTG ACC CTG GCC CTT CTA CTA GGA GGA TTA ACC ATG GGG GGA ATO* GCC :GCT iGGA Aq»40 
Ser Leu Thr Leu Ala Leu Leu Leu Gly Gly Leu Thr Met Gly Gly lla Ala Ala -Gly He 

GGA ACA GGG ACT ACT GCT CTA ATG GCC ACT CAG CAA TTC CAG CAG CTC CAA GCC GCA GUSOO 
Gly Thr Gly Thr Thr Ala Leu Met Ala Thr Gin Gin Phe Gin Gin Leu Gin Ala Ala Val 

CAG GAT GAT CTC AGG GAG GTT GAA AAA TCA ATC TCT AAC C*A GAA AAG TCT CTC ACT TCB560 
Gin Asp Asp Leu Arg Glu Val Glu Lys Ser He Ser Asn Leu Glu Lys Ser Leu Thr Sex 

CTG TCT GAA GTT GTC CTA CAG AAT CGA AGG GGC CTA OSfi-STG TTA TTT CTA AAA GAA GO2520 
Leu Ser Glu Val Val Leu. Gin Asn Arg Arg Gly Leu Asp Leu Leu Phe Leu Lys Glu Gly 

GGG CTG TGT GCT GCT CTA AAA GAA GAA TGT TGC TTC TAT GCG GAC CAC ACA GGA CTA GKS80 
Gly Leu Cys Ala Ala Leu Lys Glu Glu Cys Cys Phe Tyr Ala Asp His Thr Gly Leu Val 

AGA GAC AGC ATG GCC AAA TTG AGA GAG AGG CTT AAT CAG AGA CAG AAA CTG TTT GAG TOS4 0 
Ajrg Asp Ser Met Ala Lys Leu Arg Glu Arg Leu Asn Gin Arg Gin Lys Leu Phe Glu Ser 

ACT CAA GGA TGG TTT GAG GGA CTG TTT AAC AGA TCC CCT TGG TTT ACC ACC TOG ATA TCTBO0 
Thr Gin Gly Trp Phe Glu Gly Leu f?he Asn Arg Ser Pro Trp Phe Thr Thr Lfu He Ser"'' 



ACC ATT ATG GGA CCC CTC ATT GTA CTC CTA ATG ATT TTG CTC TTC GGA CCC TGC ATT CTB60 
Thr He Met Gly Pro Leu He Val fceu Leu Met He Leu Leu Phe Gly Pro Cys He Leu 

AAT CGA TTA GTT CAA TTT GTT AAA pAC AGG ATC TCA GTA GTC CAG G$T TTA GTC CTG KQ32Q 
Asxx Arg Leu Val Gin Phe Val Lys j\sp Arg He Ser Val Val Gin Ala Leu V$l Leu Thr^ 

CAA CAA TAC CAC CAG CTA AAA CCA CTA GAA TAC GAG CCA I959 
Glu Gin Tyr His Gin Leu Lys Pro Leu Glu Tyr Glu Pro 
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Met Ala Arg Ser Thr Leu Ser 



Lys jpi 



GTA ACC TGG AGA GTC ACC AAC CTG 
Val Thr Trp Arg Val Thr Asn Leu 

GGA ACT GTA CAA GAT GCC TTC CCA 
Gly Thr Val Gin Asp Ala Phe Pro 



pet 



GGA ACA GAT CCT ATT ACC ATG TTC trCC 

per 



Gly Thr Asp Pro He Thr Met Phe 

GTC CCC ATA GGG CCC AAC CCA GTA 
Val Pro lie Gly Pro Asn Pro' Val 



trTA 
peu 



ACA CAT CAG GCC CTA TGT AAT ACC 
Thr His Gin Ala Leu Cys Asn Thr 



GAT 
Asp 


AAG 
Lys 


AIT 
lie 


AAC CCG TGG AAG 
Asn Pro Trp Lys 


CCC 
Pro 


TTA60 
Leu 


GCA 
Ala 


GAG 
Glu 


AGC 
Ser 


CCC CAT 
Pro His 


CAG GTC 
Gin Val 


TTT 
Phe 


AAT2 0 
Asn 


CGT 
Arg 


ACC 
Thr 


GCC 
Ala 


AAT GCC 
Asn Ala 


ACC TCC 
Thr Ser 


CTC 
Leu 


CTG 80 
Leu 


TTT 
Phe 


GAT 
Asp 


CTA 
Leu 


TGT GAT 
Cys Asp 


CTG GTC 
Leu Val 


GGA 
Gly 


GAG40 
Glu 


GGG 
Gly 


TAT 
Tyr 


GGC 
Gly 


TGC AAG 
Cys Lys 


TAC CCC 
Tyr Pro 


GCA 
Ala 


GGG00 
Gly 


TGC 
Cvs 


CCT 
Pro 


GGG 
Glv 


CAT ACC 
His Thr 


GTA AAG 
Val Lys 


TCG 
Ser 


GGG60 
Glv 


TGG 
Trp 


GGG 
Gly 


TGT 
Cys 


GAA ACC 
Glu Thr 


ACC GGA 
Thr Gly 


CAG 
Gin 


GCS20 
Ala 


TCC 
Ser 


CTT 
Leu 


AAG 

LVB 


CCC GGT 
Arg Gly 


AAC ACC 
Asn Thr 


CCC 
Pro 


TG380 


TGC 
Cvs 


TAC 
Tvr 


GAC 
Asp 


CTC TCC 
Leu Ser 


AAA GTA 
Lys Val 


TCC 
Ser 


AA*S40 
Asn 


AAC 
Asn 


CCT 
Pro 


CTA 
Leu 


GTC CTA 
Val Leu 


GAA TTC 
Glu Phe 


ACT 
Thr 


GABO0 
Asp 


TCG 
Ser 


TGG 
Trp 


GGA 
Gly 


CTG AGA 
Leu Arg 


CTG TAC 
Leu Tyr 


CGG 
Arg 


ACS 60 
Thr 


CGG 
Arg 


CAG 
Gin 


GTC 
Val 


CTO 'AAT 
Leu Asn 


GTG GGA 
Val Gly 


CCC 
Pro 


CGS20 
Arg 


CAA 
Gin 


AGA 
Arg 


CTC 
Leu 


CCT TCC 
Pro Ser 


TCA CCA 
Ser Pro 


ATA 
He 


GAG 80 
Glu 


aat' 

Asn 


ACC 
Thr 


AGT 
Ser 


TAC CCC 
Tyr Pro 


CCT TCC 
Pro Ser 


ACT 
Thr 


AO340 
Thr 


GTC 
Val 


CCA 
Pro 


CAG 
Gin 


CCA CCC 

Pro Pro 


CCA GGT 
Pro Gly 


ACC 
Thr 


GASO0 
Glu 


CAA 
Gin 


GCC 
Ala 


CTC 
Leu 


AAC CTC 
Asn Leu 


ACC AGT 
Thr Ser 


CCT 
Pro 


GA960 
Asp 


GGA 
Gly 


CCC 
Pro 


CCC 
Pro 


TAC TAC 
Tyr Tyr 


GAA GGG 
Glu Gly 


GTT 
Val 


GOQD20 
Ala 


CCA 
Pro 


GCC 
Ala 


AAC 
Asn 


TGC TCC 
Cys Ser 


GTG GCC 
Val Ala 


TCC 
Ser 


CAA) 80 
Gin 


GGA. 
Gly 


CTC 
Leu 


TGC 
Cys 


ATA GGA 
He Gly 


GCA GTT 
Ala Val 


CCC 

Pro 


AAA 4 0 
Lys 


AGC 
Ser 


AGT 
Ser 


CGA 
Arg 


GGG TCC 
Gly Ser 


TAT TAT 
Tyr Tyr 


CTA 
Leu 


GOT 00 

val 
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GCC CCT ACA GGT ACC ATG TGG GCT TGT AGT ACC GGG CTT ACT CCA TGC ATC TCC ACC AdfleO 

Ala Pro Thr Gly Thr Wet Trp Ala Cys Ser Thr Gly Leu Thr Pro Cys lie Ser Thr Thr 

ATA CTG AAC CTT ACC ACT GAT TAT TGT GOT CTT GTC GAA CTC TGG CCA AGA GTC ACC TAT 20 
lie Leu Asn Leu Thr Thr Asp Tyr Cys Val Leu Val Glu Leu Trp Pro Arg Val Thr Tyr 

CAT TCC CCC AGC TAT GTT TAG GGC CTG TTT GAG AGA TCC AAC CGA CAC AAA AGA GAA COG80 
His Ser Pro Ser Tyr Val Tyr Gly Leu Phe Glu Arg Ser Asn Arg His Lys Arg Glu Pro 

GTG TCG TTA ACC CTG GCC CTA TTA TTG GGT GGA CTA ACC ATG GGG GGA ATT GCC GCT GOA40 
Val Ser Leu Thr Leu Ala Leu Leu Leu Gly Gly Leu Thr Met Gly Gly lie Ala Ala Gly 

ATA GGA ACA GGG ACT ACT GCT CTA ATG GCC ACT CAG CAA TTC CAG CAG CTC CAA GCC GO500 
lie Gly Thr Gly Thr Thr Ala Leu flet Ala Thr Gin Gin Phe Gin Gin Leu Gin Ala Ala 

GTA CAG GAT GAT CTC AGG GAG GTT |3AA AAA TCA ATC TCT AAC CTA GAA AAG TCT CTC AOS 60 
Val Gin Asp Asp Leu Arg Glu Val |3lu Lys Ser He Ser Asn Leu Glu Lys Ser Leu Thr 

TCC CTG TCT GAA GTT GTC CTA CAG Kat CGA AGG GGC CTA GAC TTG TTA TTT CTA AAA GAS20 
ser Leu Ser Glu Val Val Leu Gin |&sn Arg Arg Gly Leu Asp Leu Leu Phe Leu Lys Glu 

GGA GGG CTG TGT GCT GCT CTA AAA GAA GAA TGT TGC TTC TAT GCG GAC CAC ACA GGA CIS 80 
Gly Gly Leu Cys Ala Ala Leu Lys Glu Glu Cys Cys Phe Tyr Ala Asp His Thr Zly Leu 

GTG AGA GAC AGC ATG GCC AAA TTG AGA GAG AGG CTT AAT CAG AGA CAG AAA CTG TTT G2L340 
Val Arg Asp Ser Met Ala Lys Leu Arg Glu Arg Leu Asn Gin Arg Gin Lys Leu Phe Glu 

TCA ACT CAA GGA TGG TTT GAG GGA CTG TTT AAC AGA TCC CCT TGG TTT ACC ACC TTG AT300 
ser Thr Gin Gly Trp Phe Glu Gly Leu Phe Asn Arg Ser Pro Trp Phe Thr Thr Leu Zle 

TCT ACC ATT ATG GGA CCC CTC ATT GTA CTC CTA ATG ATT TTG CTC TTC GGA CCC TGC ATB60 
Ser Thr He Met Gly Pro Leu He Val Leu Leu Met He Leu Leu Phe Gly Pro Cys He 

CTT AAT CGA TTA GTT CAA TTT GTT *AA GAC AGG ATC TCA GTA GTC CAG GCT TTA GTC CTOI 
Leu Asn Arg Leu Val Gin Phe Val Lys Asp Arg He Ser Val Val Gin Ala Leu Val Leu 

ACT CAA CAA TAC CAC CAG CTA AAA -GCA CTA GAA TAC GAG CCA 1962 
Thr/ Gin Gin Tyr His Gin Leu Lys Pro Leu Glu Tyr Glu Pro 
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ATG GCG CGT TCA ACQ CTC TCA AAA CCC CCT CAA GAT AAG ATT AAC CCG TGG AAG CCC TTA60 
Met Ala Arg Ser Thr Leu Ser Lys pro Pro Gin Asp Lys lie Asn Pro Trp Lys Pro Leu 

ATA GTC ATG GGA GTC CTG TTA GGA GTA GGG ATG GCA GAG AGC CCC CAG GTC TTT AAT GTA20 
He Val Met Gly Val Leu Leu Gly pal Gly Met Ala Glu Ser Pro Gin Val Phe Asn Val 

ACC TGG AGA GTC ACC AAC CTG ATG jftCT GGG CGT ACC GCC AAT GCC ACC TCC CTC CTG GGA 80 
Thr Trp Arg Val Thr Asn Leu Met (rhr Gly Arg Thr Ala Asn Ala Thr Ser Leu Leu Gly 

ACT GTA CAA GAT GCC TTC CCA AAA JlTA TAT TTT GAT CTA TGT GAT CTG GTC GGA GAG GAG40 
Thr Val Gin Asp Ala Phe Pro Lys Leu Tyr Phe Asp Leu Cys Asp Leu Val Gly Glu Glu 

TGG GAC CCT TCA GAC CAG GAA CCG TAT GTC GGG TAT GGC TGC AAG TAC CCC GCA GGG AGSO0 
Trp Asp Pro Ser Asp Gin Glu Pro Tyr Val Gly Tyr Gly Cys Lys Tyr Pro Ala Gly Arg 

CAG CGG ACC CGG ACT TTT GAC TTT TAC GTG TGC CCT GGG CAT ACC GTA AAG TCG GGG TGT60 
Gin Arg Thr Arg Thr Phe Asp Phe Tyr Val Cys Pro Gly His Thr Val Lys Ser Gly cys 

GGG GGA CCA GGA GAG GGC TAC TGT GGT AAA TGG GGG TGT GAA ACC ACC GGA CAG GCT TAS2 0 
Gly Gly Pro Gly Glu Gly Tyr Cys Gly Lys Trp Gly Cys Glu Thr Thr Gly Gin Ala Tyr 

TGG AAG CCC ACA TCA TCG TGG GAC CTA ATC TCC CTT AAG CCC GGT AAC ACC CCC TGG GAS 80 
Trp Lys Pro Thr Ser Ser Trp Asp jLeu He Ser Leu Lys Arg Gly Asn Thr Pro Trp Asp 

ACG GGA TGC TCT AAA GTT GCC TGT GGC CCC TGC TAC GAC CTC TCC AAA GTA TCC AAT TCS40 
Thr Gly Cys Ser Lys Val Ala Cys Gly Pro Cys Tyr Asp Leu Ser Lys Val Ser Asn Ser 

TTC CAA GGG GCT ACT CGA GGG GGC AGA TGC AAC CCT CTA GTC CTA GAA TTC ACT GAT GOS00 
Phe Gin Gly Ala Thr Arg Gly Gly cys Asn Pro Leu Val Leu Glu Phe Thr Asp Ala 

GGA AAA AAG GCT AAC TGG GAC GGG bcc AAA TCG TGG GGA CTG AGA CTG TAC CGG ACA GGJS60 
Gly Lys Lys Ala Asn Trp Asp Gly pro Lys Ser Trp Gly Leu Arg Leu Tyr Arg Thr Gly 

ACA GAT CCT ATT ACC ATG TTC TCC bTG ACC CGG CAG GTC CTT AAT GTG GGA CCC CGA GTC 20 

Thr Asp Pro He Thr Met Phe Ser ^eii Thr Arg Gin Val Leu Asn Val Gly Pro Arg Val 

CCC ATA GGG CCC AAC CCA GTA TTA CCC GAC CAA AGA CTC CCT TCC TCA CCA ATA GAG ATT 80 
Pro He Gly Pro Asn Pro Val Leu Pro Asp Gin Arg Leu Pro Ser Ser Pro He Glu He 

GTA CCG GCT CCA CAG CCA CCT AGC CCC CTC AAT ACC ACT TAC CCC CCT TCC ACT ACC AG'S 4 0 
Val Pro Ala Pro Gin Pro Pro Ser Pro Leu Asn Thr Ser TyV Ppo Pro Ser Thr. Thr Ser 

ACA CCC TCA ACC TCC CCT ACA ACT CCA ACT GTC CCA CAG CCA CCC CCA GGT ACC GAA AA300 
Thr Pro ser Thr Ser Pro Thr Ser Pro Ser Val Pro Gin Pro Pro Pro Gly Thr Glu Asn 

AGG CTG CTA AAC TTA GTA GAC GGA GCC TAC CAA GCC CTC AAC CTC ACC AGT CCT GAC AAA60 
Arg Leu Leu Asn Leu Val Asp Gly Ala Tyr Gin Ala Leu Asn Leu Thr Ser Pro Asp Lys ' ■ 

ACC CAA GAG TGC TGG TTG TGT CTA GTA GCG GGA CCC CCC TAC TAC GAA GGG GTT GCC GTB020 
Thr Gin Glu Cys Trp Leu Cys Leu Val Ala Gly Pro Pro Tyr Tyr Glu Gly Val Ala Val 

CTG GCT ACC TAC TCC AAC CAT ACC TCT GCT CCA GCC AAC TGC TCC GTG GCC TCC CAA CAO80 
Leu Gly Thr Tyr Ser Asn His Thr Ser Ala Pro Ala Asn cys Ser Val Ala Ser Gin His 

AAG TTG ACC CTG TCC GAA GTG ACC EGA CAG GGA CTC TGC ATA GGA GCA GTT CCC AAA A0A40 
Lys Leu Thr Leu Ser Glu Val Thr fely Gin Gly Leu Cys He Gly Ala Val Pro Lys Thr 

CAT CAG GCC CTA TGT AAT ACC ACC CAG ACA AGC AGT CGA GGG TCC TAT TAT CTA GTT GOG 00 
His Gin Ala Leu Cys Asn Thr Thr Jain Thr Ser Ser Arg Gly Ser Tyr Tyr Leu Val Ala 
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CCT ACA GGT ACC ATG TGG GCT TGT AGT ACC GGG CTT ACT CCA TGC ATP TCC ACC ACC &1&60 
Pro Thr. Gly Thr Met Trp Ala Cys Ser Thr Gly Leu Thr Pro Cys lie Ser Thr Thr lie 

CTG AAC CTT ACC ACT GAT TAT TGT GTT CTT GTC GAA CTC TGG CCA AGA GTC ACC TAT CMOS 20 
Leu Asn Leu Thr Thr Asp Tyr Cys Val Leu Val Glu Leu Trp Pro Arg Val Thr Tyr His 

TCC CCC AGC TAT GTT TAC GGC CTG ITT GAG AGA TCC AAC CGA CAC AAA AGA GAA CCG G«2380 
Ser Pro Ser Tyr Val Tyr Gly Leu Phe Glu Arg Ser Aan Arg His Lys Arg Glu Pro Val 

TCG TTA ACC CTG GCC CTA TTA TTG GGT GGA CTA ACC ATG GGG GGA ATT GCC GCT GGA AM 40 
Ser Leu Thr Leu Ala Leu Leu Leu Gly Gly Leu Thr Met Gly Gly He Ala Ala Gly He 

GGA ACA GGG ACT ACT GCT CTA ATG pCC ACT CAG CAA TTC CAG GAG CTC CAA GCC GCA CT500 
Gly Thr Gly Thr Thr Ala Leu Met £la Thr Gin Gin Phe Gin Gin Leu <31n Ala Ala Val 

CAG GAT GAT CTC AGG GAG GTT GAA &AA TCA ATC TCT AAC CTA GAA AAG TCT CTC ACT TOS60 
Gin Asp Asp Leu Arg Glu Val Glu |Lys Ser Xle Ser Asn Leu Glu Lys Ser Leu Thr Ser 

CTG TCT GAA GTT GTC CTA CAG AAT CGA AGG GGC CTA GAC TTG TTA TTT CTA AAA GAA GOS20 
Leu Ser Glu Val Val Leu Gin Asn Krg Arg Gly Leu . Asp Leu Leu Phe Leu Lys Glu Gly 

GGG CTG TGT GCT GCT CTA AAA GAA GAA TGT TGC TTC TAT GCG GAC CAC ACA GGA CTA GTS380 
Gly Leu Cys Ala Ala Leu Lys Glu Glu Cys Cys Phe Tyr Ala Asp His Thr Gly Leu Val 

AGA GAC AGC ATG GCC AAA TTG AGA GAG AGG CTT AAT CAG AGA CAG AAA CTG TTT GAG TCU240 
Arg Asp Ser Met Ala Lys Leu Arg Glu Arg Leu Asn Gin Arg Gin Lys Leu Phe Glu Ser 

ACT CAA GGA TGG TTT GAG GGA CTG TTT AAC AGA TCC CCT TGG TTT ACC ACC TTG ATA TOSOO 
Thr Gin Gly Trp Phe Glu Gly Leu Phe Asn Arg Ser Pro Trp Phe Thr Thr Leu He Ser 

• . • • . ■• • ■ . i » 

ACC ATT ATG GGA CCC CTC ATT GTA CTC CTA ATG ATO TTG CTC TTC GGA CCC TGC ATT CTB60 
Thr He Met Gly Pro Leu He Val Leu Leu Met He Leu Leu Phe Gly Pro Cys He Leu 

AAT CGA TTA GTT CAA TTT GTT AAA GAC AGG ATC TCA GTA GTC CAG GCT TTA GTC CTG AOS 20 
Aim Arg Leu Val Gin Phe Val Lys Asp Arg He Ser Val Val Gin Ala Leu Val Leu Thr 

CAA CAA TAC CAC CAG CTA AAA CCA CTA GAA TAC GAG CCA 1959 
Gin Gin Tyr His Gin Leu Lys Pro ^eu Glu Tyr Glu Pro 



FIG. 28 (continued) 
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A/TG GCG CGT TCA ACG CTC .TCA AAA CCC CTT AAA AAT AAG GTT AAC CCG CGA GGC CCC CTA60 

Met Ala Arg Ser Thr Leu Ser Lys Pro Leu Lys Asn Lys Val Asn Pro Arg Gly Pro Leu 

ATC CCC TTA ATT CTT CTG ATG CTC AGA GGG GTC AGT ACT GCG GCC CAG CCG GCC ATG GCCL20 
lie Pro Leu lie Leu Leu Met. Leu Arg Gly Val Ser Thr Ala Ala Gin Pro Ala Met Ala 

CAG GTT CAG CTT CAG CAG TCT GGA GCT GAA CTG GTA AAG CCT GGG GCT TCA GTG AAG TTG80 
Gin Val Gin Leu Gin Gin Ser Gly Ala Glu Leu Val Lys Pro Gly Ala Ser Val Lys Leu 

TCC TGC AAG GCT TCT GGC TAC ACC fcTC AC A AGC TAT GAT ATA AAC TGG GTG AGG CAG AGO 40 
Ser Cys Lys Ala Ser Gly Tyr Thr |Phe Thr ser Tyr Asp lie Asn Trp Val Arg Gin Arg 

CCT GAA CAG GGA CTT GAG TGG ATT felGA TGG ATT TTT CCT GGA GAT GGT AGT ACT AAG TAG 00 
Pro Glu G in Gly Leu Glu Trp lie ply Trp lie Phe Pro Gly Asp Gly Ser Thr Lys Tyr 

AAT GAG AAG TTC AAG GGC AAG GCC ^CA CTG ACT ACA GAC AAA TCC TCC AGC ACA GCC TAG 60 
Asn Glu Lys Phe Lys Gly Lys Ala JThr Leu Thr Thr Asp Lys Ser Ser Ser Thr Ala Tyr 

ATG CAG CTC AGC AGG CTG ACA TCT GAA GAC TCT GCT ATC- TAT TTC TGT GCA AGA ATG GCS20 
Met Gin Leu Ser Arg Leu Thr Ser Glu Asp Ser Ala lie Tyr Phe Cys Ala Arg Met Ala 

i" 

GAC TAT GGT CAC TAC GGC GCT GTG GAC TAC TGG GGT CAA GGG ACC ACG GTC ACC GTC TCQ80 
Asp Tyr Gly His Tyr Gly Ala Val Asp Tyr Trp Gly Gin Gly Thr Thr Val Thr 'Val Ser 

TCA GGT GGA GGC GGT TCA GGC GGA GGT GGC TCT GGC GGT GGC GGA TCG GAC ATC CAG ATS 40 
Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Asp lie Gin |iet 

ACT CAG TCT CCA TCC TCC CTG TCT GCC TCT CTG GGA GAC AGA GTC ACC ATC AGT TGC AG<S00* 
Thr Gin Ser Pro Ser Ser Leu Ser ^Vla Ser Leu Gly Asp Arg Val Thr lie Ser Cys Arg 

GCA AGT CAG GAC ATT AGC AAT TAT TTA AAC TGG TAT CAG CAG ACA CCA GAT GGA ACT GTB60 
Ala Ser Gin Asp lie Ser Asn Tyr Xeu Asn Trp Tyr Gin Gin Thr Pro Asp Gly Thr Val 

AAA CTC CTG ATC TAC TAC ACA TCA AGA TTA CAC TCA GGA GTC CCA TCA AGG TTC AGT GGC22Q 
Lyg Leu Leu lie Tyr Tyr Thr Ser £xg Leu His Ser Gly Val Pro Ser Arg Phe Ser Gly 

AGT GGG TCT GGA ACA GAT TAT TCT fclTC ACC ATT AGC AAC CTG GAG CAA GAA GAT ATT GCCT80 
Ser Gly Ser Gly Thr Asp Tyr Ser fceu Thr lie Ser Asn Leu Glu G^n Glu Aisp lie Asia* 

ACT TAC TTT TGC CAA CAG GGT TAT kcG CTT CCG TGG ACG TTC GGT GGA GGC ACC AAG CT340 
Thr Tyr Phe Cys Gin Gin Gly Tyr Thr Leu Pro Trp Thr Phe Gly Gly Gly Thr Lys Len 

GAG ATC AAA CGG GCG GCC GCA GGA ,CCC CGA GTC CCC ATA GGG CCC AAC CCA GTA TTA CCO00 
Glu lie Lys Arg Ala Ala Ala Gly Pro Arg Val Pro lie Gly Pro Asn Pro Val Leu Pro 

GAC CAA AGA CTC CCT TCC TCA CCA ATA GAG ATT GTA CCG GC?T CCA CAG CCA CCT AGC CCO60 
Asp Gin Arg Leu Pro Ser Ser Pro lie Glu lie Val Pro Ala Pro Gin Pro Pro Ser Pro 

CTC AAT ACC AGT TAC CCC CCT TCC ACT ACC AGT ACA CCC TCA ACC TCC CCT ACA AGT CGLS20 
Leu Asn Thr Ser Tyr Pro, Pro Ser Thr Thr Ser Thr Pro Ser Thr Ser Pro T^xr Ser Pro* 

AGT GTC CCA CAG CCA CCC CCA GGA ACT GGA GAT AGA CTA CTA GCT CTA GTC AAA GGA GCLGD80 . * 
Ser Val Pro Gin Pro Pro Pro Gly jrhr Gly Asp Arg Leu Leu Ala Leu Val Lys Gly Ala 

TAT CAG GCG CTT AAC CTC ACC AAT CCC GAC AAG ACC CAA GAA TGT TGG CTG TGC TTA G^KMO 
Tyr Gin Ala Leu Asn Leu Thr Asn Pro Asp Lys Thr Gin Glu Cys Trp Leu Cys Leu Val 

FIG. 30 
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TCG GGA CCT CCT TAT TAC GAA GGA j3TA GCG GTC GTG GGC ACT TAT ACC AAT CAT TCC AOaOO 
Ser Gly Pro Pro Tyr Tyr Glu Gly Val Ala Val Val Gly Thr Tyr Thr Asn His Ser Thr 

GCT CCG GCC AAC TGT ACG GCC ACT TCC CAA CAT AAG CTT ACC CTA TCT GAA GTG ACA GO260 
Ala Pro Ala Asn Cys Thr Ala Thr Ser Gin His Lys Leu Thr Leu Ser Glu Val . Thr Gly 

CAG GGC CTA TGC ATG GGG GCA GTA CCT AAA ACT CAC GAG GCC TTA TQX AAC ACC ACC CA820 
Gin Gly Leu Cys Met Gly Ala Val Pro Lys Thr His Gin Ala Leu Cys Asn Thr Thr Gin 

AGC GCC GGC TCA GGA TCC 'TAC TAC CTT GCA GCA CCC GCC GGA ACA ATG TGG GCT TGC AOO80 
ser Ala Gly Ser Gly Ser Tyr Tyr Leu Ala Ala Pro Ala Gly Thr Met Trp Ala Cys Ser 

ACT GGA TTG ACT CCC TGC TTG TCC ^CC ACG GTG CTC AAT CTA ACC ACA GAT TAT TGT GM40 
Thr Gly Leu Thr Pro Cys Leu Ser Thr Thr Val Leu Asn Leu Thr Thr Asp Tyr Cys Val 

TTA GTT GAA CTC TGG CCC AGA GTA ATT TAC CAC TCC CCC GAT TAT ATG TAT GGT CAG CTB0O 
Leu Val Glu Leu Trp Pro Arg Val lie Tyr His Ser Pro Asp Tyr Met Tyr Gly Gin Leu 

GAA CAG CGT ACC AAA TAT AAA AGA |3AG CCA GTA TCA TTG ACC CTG GCC CTT CTA CTA GO560 
Glu Gin Arg Thr Lys Tyr Lys Arg plu Pro Val Ser Leu Thr Leu Ala Leu Leu Leu Gly 

GGA TTA ACC ATG GGA GGG ATT GCA feCT GGA ATA GGG ACG GGG ACC ACT GCC TTA ATT AA2S20 
Gly Leu Thr Met Gly Gly lie Ala Ala Gly lie Gly Thr Gly Thr 'Tor Ala Leu He Lys 

ACC CAG CAG TTT GAG CAG CTT CAT fecc GCT ATC CAG ACA GAC CTC AAC GAA GTC GAA AAJ980 
Thr Gin Gin Phe Glu Gin Leu His Ala Ala lie Gin Thr Asp Leu Asn Glu Val Glu Lys 

TCA ATT ACC AAC CTA GAA AAG TCA CTG ACC TCG TTG TCT GAA GTA GTC CTA CAG AAC CO04O 

Ser ile Thr Asn Leu Glu Lys Ser Leu Thr Ser Leu Ser Glu Val Val Leu « lri Asn Arg 

i* - • 

AGA GGC CTA GAT TTG CTA TTC CTA AAG GAG GGA GGT CTC TGC GCA GCC CTA GAS 00 

Arg Gly Leu Asp Leu Leu Phe Leu Lys Glu Gly Gly Leu Cys Ala Ala £eu Lys GjLu Gju 

TGT TCT TTT TAT GCA GAC ,CAC ACG GGG CTA GTG AGA GAC AGC ATG GCC AAA TT£ AGA GBS§0 
Cys Cys Phe Tyr Ala Asp His Thr Gly Leu Val Arg Asp Ser Met Ala Lys Leu Arg Glii 

AGG CTT AAT CAG AGA CAA AAA CTA }rTT GAG ACA GGC CAA GGA TGG TTC GAA GGG CTG TX920 
Arg Leu Asn Gin Arg Gin Lys Leu Iphe Glu Thr Gly Gin Gly Trp Phe Glu Gly Leu Phe ' 



AAT AGA TCC CCC TGG TTT ACC ACC TTA ATC 
Asn Arg Ser Pro Trp Phe Thr Thr lieu lie 

TTA CTG ATC TTA CTC TTT GGA CCT IpGC ATT 
Leu Leu lie Leu Leu Phe Gly Pro pys lie 



TCC ACC ATC ATG GGA CCT CTA ATA GTA C0&8O 
Ser Thr lie Met Gly Pro Leu lie Val Leu 

CTC AAT CGA TTG GTC CAA TTT GTT. AAA G804O 
Leu Asn Arg Leu Val Gin Phe Val Lys Asp 



AGG ATC TCA GTG GTC CAG GCT CTG IsTT TTG ACT CAG CAA TAT CAC CAg CTA CCC ATA09 

Arg lie ser Val Val Gin Ala Leu pal Leu Thr Gin Gin Tyr His GIa Leu Lys Pro lie 

GAG TAC GAG CCA TGA 2115 
Glu Tyr Glu Pro . . . 

FIG. 30 (continued) 
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NEW POLYPEPTIDES AND THEIR USE FOR THE RESCUE OF FUSION 
DEFECTIVE VIRUS OR RETROVIRUS GLYCOPROTEINS 

The invention relates to new polypeptides, and their use for the rescue of fusion 
5 defective virus or retrovirus glycoproteins. 

Retroviruses enter cells following their attachment to specific cell surface receptors. 
This function is mediated by the envelope glycoproteins expressed as trimeric complexes on 
the viral particles (Hunter, 1997). Interaction with the receptor is thought to cause a 
conformational change of the glycoprotein structure necessary to expose a "fusion peptide", 

10 buried within the native envelope complex and involved in the actual membrane fusion 
process (Gerlier, 1998). On the basis of both genetic (Battini et al., 1995; Battini et al. f 
1992; Morgan et al., 1993; Ott and Rein, 1992) and structural (Fass et al ? , 1997; Fass et 
al. s 1996) evidence, the attachment and fusion functions of murine leukemia virus (MLV) 
envelopes have been separated on the two subunits of the envelope monomer. Thus it is 

15 generally considered that the amino-tenninal half of the MLV SU (surface) subunit is 
responsible for the binding to the receptor whereas the ectodomain of the TM 
(transmembrane) subunit contains the "fusion machinery" composed of an amino-terminal 
fusion peptide associated to a coiled-coil structure that rearrange upon receptor interaction. 
How retrovirus envelope glycoproteins convert binding to the receptor to a signal 

20 which activates the fusion machinery is currently unknown. Ample evidence from the 
literature indicate that functional inter-relations between different domains of the envelope 
complex are essential to promote fusion activation (Lavillette et al., 1998; Rein et al., 1998; 
Zhao et al., 1997; Zhao et al., 1998). 

Retroviruses can be involved in two different types of membrane fusion : 

25 - a fusion between viral and cell membranes triggered by the interaction of the virion 
envelope glycoproteins with specific cell surface molecules, which are the viral receptors 
(virus-cell fusion), with said fusion being responsible for retroviral infection, 
- a fusion between two cell membranes triggered by the interaction of the envelope 
glycoproteins expressed at the surface of a cell and the viral receptors expressed on a 

30 neighbouring cell (cell-cell fusion), the visible outcome of this latter being the formation of 
cell syncytia containing up to 100 nuclei (also known as polykaryocytes or multinucleated 
giant cells); syncytium-formation results in the death of the cells which make up the 
syncytium. 
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Furthermore, cleavage of the R peptide (inhibitory peptide on the cytoplasmic tail of 
the TM subunit of the envelope glycoprotein) is required to activate fully the ftisogenic 
potential of these envelope glycoproteins and mutants lacking the R peptide show greatly 
enhanced activity in cell fusion assays (Rein et al, J. Virol., 1994, 68, 1773-1781; Ragheb 
5 & Anderson, J. Virol., 1994, 68, 3220-3231; Brody et al, J. Virol., 1994, 68, 4620-4627). 

Furthermore, it is known that a vector directing the expression on a eukaryotic cell 
surface of a syncytium-inducing polypeptide can be used in gene therapy of malignant 
diseases, as a result of selective elimination of unwanted cells, through syncytia formation 
(WO 98/40492). 

10 The fusogenicity of envelope glycoproteins has been enhanced as described in 

Lavillette et al. (1998) and internal application WO 99/36561. The proline-rich region 
(PRR) of amphotropic MLV envelope glycoprotein was shown to be an essential 
determinant of membrane fusion activation and specific PRR-mutated envelope 
glycoproteins were obtained and were characterised by an increased cell-cell or virus-cell 

15 fusogenicity. 

Studies on Mo-MLV envelope protein GP70 have shown that the first eight amino 
acids of the mature gp70 protein are essential for fusion and virus entry. Within this region, 
a particularly important residue is H8. This residue belongs to the PHQ motif which is 
conserved among all MLVs and some other retroviruses, such as a gibbon ape leukemia 
20 virus SEATO and feline leukemia virus types A, B, and C. The amphotropic counterpart of 
Del2-8, which has the first six amino acids (Met-Ala-Glu-Ser-Pro-His) deleted has been 
constructed. Vectors bearing the deletion mutant envelope protein could not transduce NIH 
3T3 or HeLa cells (Bae et al., Journal of Virology, March 1997, p. 2092-2099). 

As the fusion mechanisms induced by interaction of retrovirus envelope 
25 glycoproteins and receptors of cell membrane are partially unknown, the regulation of the 
fusogenicity of retroviral or viral envelope glycoproteins was up to day not possible. 

One of the aims of the invention is to enable fusion control of envelope glycoproteins 
with target cells. 

It is an aim of the invention to provide means for the rescue of the fusion property of 
30 fusion defective viral or retroviral envelope glycoproteins. 
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It is another aim of the invention to provide a process for restoring the ftisogenic 
property of retroviral or viral envelope glycoproteins, the fiisogenic property of which has 
been inactivated and for controlling their activation by external means. 

It is another aim of the invention to provide a process in which envelope 
5 glycoproteins are in a first stage inactivated as to their fusion property and then in a second 
stage activated in a controlled and regulated way, by means of activating polypeptides, 
which possibly present targeting capacities, and which are able to restore efficiendy the 
fusogenic and cytotoxic properties of said envelope glycoproteins. 

It is another aim of the invention to provide polypeptides liable to activate the cell- 
10 cell fixsogenicity or the virus-cell fusogenicity. 

It is another aim of the invention to provide polypeptides liable to activate the virus- 
cell fusogenicity and which thus can be used in strategies involving retroviral vectors 
comprising envelope glycoproteins, the fusion property of which can be activated. 

It is another aim of the invention to provide ftisogenic envelope glycoproteins 
15 presenting cytotoxic properties for use in the treatment of malignant diseases, for which the 
fusogenic property could be controlled by means of activating polypeptides. 

It is another aim of the invention to provide cytotoxic fusogenic envelope 
glycoproteins having cellular specificity and for which the fusogenic property can be 
controlled by means of activating polypeptides. 
20 All these aspects are achieved through the present invention, which in one of its 

broad embodiments is the use of at least one activating polypeptide comprising an amino 
acid sequence corresponding to : 

- the amino acid sequence of the surface subunit (SU) of an envelope glycoprotein of a first 
virus, or of a first retrovirus, or 

25 - an amino acid sequence derived from said surface snbunit, or 

- a fragment of an amino acid sequence of said surface subunit, encompassing the receptor 
binding domain (RBD) and the amino terminal part of the surface subunit (SU), 

for the rescue of the fusion property of a fusion defective envelope glycoprotein, which has 
the property of attachment to target cells and is inactive with respect to the fusion with said 
30 target cells, said fusion defective envelope glycoprotein belonging to a second virus or a 
second retrovirus of the same type as or of a different type from the above designated "first 
virus or first retrovirus". 
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Thus the invention lies in the possibility of desactivating the inactivation of the 
fiisogenic property of an envelope glycoprotein, said envelope glycoprotein having 
previously been made inactive with respect to fusion. This restoration of the fusion property 
of the envelope glycoprotein is unexpectedly achieved by polypeptides whose common 
5 structure is the presence of the receptor binding domain (RBD) and the amino terminal part 
of SU in the environment of cells expressing fusion defective envelope glycoproteins and of 
cells expressing retroviral or viral receptors and the interaction of which will lead to the 
formation of syncytia, or in the environment of retroviruses incorporating fusion defective 
envelope glycoproteins and of cells expressing retroviral or viral receptors and the 

10 interaction of which will lead to infection. 

The expression "polypeptide comprising an amino acid sequence corresponding to", 
means that the polypeptide has the same amino sequence as that of the surface subunit, or as 
that of an aminoacid sequence derived from the surface subunit or of a fragment and can be 
prepared by recombinant technique or by chemical synthesis. 

15 The expression "polypeptide comprising an amino acid sequence derived from said 

surface subunit" means that the polypeptide has an aminoacid sequence which is not strictly 
identical to that of the sequence of the surface subunits and may contain mutations such as 
deletion or substitution of aminoacids of the parental sequence or insertion of aminoacid 
sequence of other origins. 

20 The "amino terminal part of the surface subunit" is defined as the aminoacid 

sequence of the first aminoacids of a length of about 10 to about 15 aminoacids of the 
envelope surface subunit, not including the aminoacid sequence of the signal peptide. 

A fusion defective envelope glycoprotein derives from a parental envelope 
glycoprotein in which the fusogenic property has been inactivated, but which still presents 

25 the property of recognizing (attachment) target cells. 

The expression "inactive with respect to the fusion" means that in a test for the measure of 
the fusogenic activity, the fusion index is either nul or significantly reduced when the fusion 
assay is allowed to proceed for about 20 minutes. 

The rescue of the fusion property can be measured according to one of the following 

30 tests : 

- test of formation of syncytia : expressing vectors encoding the envelope glycoprotein to be 
tested are transfected into cells on day 0. On day 2, the transfected cells are overlaid by 
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indicator cells which express the cell surface receptors corresponding to the tested envelope 
glycoprotein. Detection of syncytia (i.e. fusion of envelope-expressing cells with 
neighbouring indicator cells) is performed at day 3 or day 4 and fusion indexes are 
measured as previously described (Lavillette et al, 1998) by counting the number of 
5 syncytia and the number of nuclei outside and inside the syncytia. 

- test of infection of target cells : retroviral vectors carrying a lac Z transgene are being 
generated with the tested envelope glycoproteins. Serial dilutions of the vectors are applied 
on target cells at day 0. Infected cells are stained at day 2 with X-Gal to reveal the number 
of cell colonies having integrated the lac Z-vectors and titers are determined according to 

10 standard procedures (Lavillette at al., 1998). 

It is to be noted that the activating polypeptide corresponds to the amino acid 
sequence of the SU of an envelope glycoprotein of a first virus or retrovirus and that the 
fusion defective envelope glycoprotein belongs to a second virus or retrovirus of the same 
type or of a type different from the first virus or retrovirus. This means that there are in fact 

15 two possibilities : 

- either the first and second viruses are of the same type, which means that the activating 
polypeptide and the fusion defective envelope glycoprotein are homologous, 

- or the first and second viruses are of a different type, which means that the activating 
polypeptide and the fusion defective envelope glycoprotein are heterologous. 

20 According to an advantageous embodiment the fusion defective envelope 

glycoprotein involved are those derived from retroviruses. 

Retroviral envelope glycoproteins allow receptor binding of the viral particles onto 
which they have been incorporated during virion assembly. This preliminary attachment on 
the viral receptor activates a second function of the retroviral envelope glycoproteins which 

25 induces the fusion between the viral and the target cell membranes. While for type C 
mammalian retroviruses the determinants of receptor binding (RBD» receptor binding 
domain), are located at the amino-terminal end of the SU envelope subunit, the molecular 
determinants that are responsible of membrane fusion are located in the TM subunit of the 
glycoprotein. The molecular determinants or events which are involved in the conversion of 

30 binding to the receptor into a signal which activates the TM fusion determinants are not 
precisely known although by mutagenesis analysis, they are believed to be present in all the 
SU subunit, in particular in the N-terminal end of the RBD itself (Bae et al., 1997; 
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Lavillette et al., 2000), in the proline-rich region (Lavillette et al., 1998) and in the 
carboxy-terminal domain of the SU (Nussbaum et al., 1993; Pinter et al., 1997). Thus the 
control of the activation of the fusion machinery harbored by the TM subunit might be 
achieved by introducing specific mutations that affect the fusion-activation determinant of 
5 the SU. This invention describes several mutations in the SU which can inhibit or partially 
inhibit the SU determinants which control activation of the TM fusion-machinery. It is 
important to understand that such mutations do not impair the fusogenic potential of the 
mutant envelope glycoproteins, but only impair the molecular determinants that control the 
activation of the fusion machinery. The latter one is still intact, despite the mutations 

10 introduced in the SU, but is not activated upon receptor binding. In addition the invention 
describes the use of soluble polypeptides that can alleviate the effect of these mutations and 
hence couple the fusion machinery to a device that is provided exogenously in substitution 
to the normal fusion control determinants which are disrupted in the mutant envelope 
glycoproteins. Thus the addition of the "activating" polypeptides can restore or rescue the 

15 fusogenicity of the mutant envelope glycoproteins. 

Several mutations or modifications of the envelope glycoprotein can be used to 
reversibly inhibit or reduce their fusogenicity. 

A first method is the introduction of the delH mutation that removes an histidine 
from the PHQ conserved motif located at the ammo-terminal end of the SU envelope 

20 subunit of type C mammalian retroviruses such as MLV-A, MoMLV and GALV (mutations 
H5del, H8del or H9del, respectively). Such a mutation induces a drastic effect in the 
capacity of the receptor binding domain (RBD) of these envelope glycoproteins to triggers 
further events required to promote membrane fusion. 

A second method consists in the replacement of the SU C-terminal domain (G 

25 domain) with an heterologous C domain derived from the envelope glycoprotein of another 
type C mammalian retrovirus. For example; the chimeric CMO amphotropic glycoprotein 
(hereafter defined) can be constructed by combining the amphotropic MLV RBD and PRR 
(proline-rich region) with the ecotropic MoMLV C-terminal domain (example 4, Fig. 29A 
and 29B). For such chimeras, there is a low compatibility between their RBD (amphotropic 

30 origin) and C domain (ecotropic origin), which significantly impairs the kinetic of 
membrane fusion. 
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A third method consists in the replacement of the RBD by a ligand such* as a 
cytokine or a single chain antibody targeted to a cell surface specific receptor. In contrast to 
the RBD of the SU of a type C mammalian retrovirus, such ligands do not contain the 
determinants that allow further activation of the Env fusogenicity. Suck chimeric envelope 
5 glycoproteins therefore resembles a delH envelope glycoprotein which would carry an RBD 
that is targeted to a specific receptor. 

This invention relates to a method which can alleviate the inhibitory effect of these 
different mutations and thus restore the fusogenic properties of the mutant envelope 
glycoproteins. This method uses preparations of polypeptides which are derived from the 

10 RBD of different type C mammalian retrovirus envelope glycoproteins. For example, the 
addition of an RBD which is compatible with the C domain of a fusion-defective envelope 
glycoprotein will fully restore its fusogenic properties and will activate the kinetic of 
membrane fusion. Similarly such a peptide will also restore the fusogenicity of envelope 
glycoproteins that harbor a delH mutation or, alternatively, which carry a cell-type specific 

15 ligand in place of the RBD. 

This method can be applied to the activation of infectivity of retroviral vectors whose 
envelope glycoproteins carry one of these fusion-defective mutations. Depending on the 
presence of a ligand in the fusion-deficient envelope glycoprotein or, alternatively, in the 
activating polypeptide, control of infection to specific target cell types will therefore be 

20 obtained. The invention can also be applied to the control of the hyper-fusogenic activity of 
retroviral envelope glycoproteins that can induce cell-cell fusion and elimination of 
unwanted cells by syncytia formation. Indeed while most retroviral envelope glycoproteins 
are not truly fusogenic when expressed at the surface of the cells that produce the retroviral 
particles, they acquire their fusion competency during or after assembly and release of the 

25 viral particles. Yet there is considerable interest to convert the retroviral envelope 
glycoproteins into hyperfusogenic glycoproteins that can fuse cells in the close vicinity of 
the cells into which they are expressed. In addition to the introduction of the above- 
mentioned mutations that can control activation of their fusogenicity, the fusogenicity of 
retroviral envelope glycoproteins can be stimulated by several ways to obtain the best effect 

30 upon activation with an exogenous activating polypeptide. Thus, additional mutations will 
be introduced into the retroviral envelope glycoprotein, such as the truncation of the 
cytoplasmic tail (Rless mutation) and/or the modification of their proline-rich regions. 
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According to an advantageous embodiment, the activating polypeptide used in the 
invention recognizes substantially the same receptors of the target cells as the fusion 
defective envelope glycoprotein. 

According to another advantageous embodiment, the activating polypeptide used in 
5 the invention recognizes receptors of the target cells which are different from the ones 
recognized by the fusion defective envelope glycoprotein. 

It is thus quite unexpected that the activating polypeptides can restore the 
fusogenicity of fusion defective envelope glycoproteins by interacting with retroviral 
receptors which are expressed at the surface of the same target cells, and which are different 
10 from the receptors recognized by the fusion defective envelope glycoprotein. 

According to another advantageous embodiment of the invention, the activating 
polypeptide and/or the fusion defective envelope glycoprotein is (are) linked to a ligand, 
which is specific with respect to some receptors of the target cells, the ligand being 
preferably linked to the activating polypeptide. 
15 The addition of specific ligands on the activating polypeptides and/or on the fusion 

defective envelope glycoprotein enables to control the fusion activity of said fusion defective 
envelope glycoprotein and to increase the specificity with respect to target cells. Therefore, 
the design of a specific system to eliminate unwanted cells, may consist in a fusion defective 
envelope glycoprotein associated to an activating polypeptide that is linked to a ligand that 
20 can specifically recognize a receptor expressed on the target cells, whereas the activating 
polypeptide recognizes the same receptor or a different receptor expressed on the target 
cell. 

The link between the activating polypeptide and the ligand is preferably a covalent 
bond, whereby both the activating polypeptide and the ligand have been genetically fused 
25 and encoded by an expression vector as a single protein chain, and whereby the ligand is 
fused either to the amincHterminus or to the carboxy terminus of the activating polypeptide. 

According to another advantageous embodiment, the ligand is chosen among 
molecules liable to recognize receptors of target cells, for instance receptors of tumoral 
cells, such as EGF, FGF, VEGF, or liable to recognize epitope of proteins expressed by the 
30 target cells, such as recombinant monoclonal antibody. 

According to another advantageous embodiment, the activating polypeptide is under 
the form of a product as such, or is expressed by a host, such as a virus or a cell, or is 
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expressed, in particular by the target cells, transformed by nucleotide sequences coding for 
said activating polypeptide. 

In the use of the invention, one resorts either directly to chemical substances (which 
is the meaning of activating polypeptide under the form of a product as such), or to host 
5 (virus or cell) expressing said activating polypeptide. 

In the following of the text, the expression "soluble activating polypeptides" will be 
sometimes used, meaning that said polypeptides are not integrated in the envelope proteins 
of a virus. 

The capacity of the activating polypeptides to rescue the infectivity of fusion 
10 defective envelope glycoproteins is similar whether the target cells i) constitutively express 
and secrete the activating polypeptides, ii) are co-incubated with both fusion defective 
envelope glycoprotein and said activating polypeptides iii) are pre-incubated with said 
activating polypeptides before adding the fusion defective envelope glycoprotein, or iv) are 
pre-incubated with the fusion defective envelope glycoprotein before adding the activating 
15 polypeptides. 

In an advantageous use of the invention the fusion defective envelope glycoprotein is 
expressed by a host, such as a virus incorporating in its envelope said fusion defective 
envelope glycoprotein, or such as a cell, transformed by nucleotide sequences coding for 
said fusion defective envelope glycoprotein. 
20 According to the invention, the activating polypeptide advantageously corresponds 

to a fragment of about 200 to about 250 amino acids of said surface subunit comprising the 
receptor binding domain (RBD) and comprising the first amino acids of the amino terminal 
part of the surface subunit, the number of said first aminoacids varying from about 10 to 
about 15, in particular comprising the RBD and the PHQ peptide, in particular the 
25 fragments corresponding to amino acid sequences derived from type C mammalian 
retroviruses such as: 
- Murine leukemia virus such as : 

. amphotropic 
. ecotropic 

30 . xenotropic 

. polytropic 
, 10 Al MLV 
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- or GALV type retroviruses and related retrovirus such as SSAV, FELV-A, FELV-B, 
FELV-C 

in particular wherein the activating polypeptide corresponds to one of the following amino 
acid sequences : 

5 

A-RBD 
E-RBD 
GALV-RBD 
34-1-A-RBD 
10 34-1-E-RBD 

34-1-GALV-RBD 
EGF-A-RBD 
EGF-E-RBD 
EGF-GALV-RBD 

15 - A-RBD corresponds to the activating polypeptide comprising the RBD domain and the 
PHQV motif of the amphotropic MLV surface subunit. 

- E-RBD corresponds to the activating polypeptide comprising the RBD domain and the 
PHQV motif of the ecotropic MLV surface subunit. 

- GALV-RBD corresponds to the activating polypeptide comprising the RBD domain and 
20 the PHQP motif of the GALV surface subunit. 

- 34.1-A-RBD corresponds to the activating polypeptide A-RBD above defined, linked to a 
ligand which is a single chain antibody directed against MHCL 

- 34.1-E-RBD corresponds to the activating polypeptide E-RBD above defined, linked to a 
ligand which is a single chain antibody directed against MHCL 

25 - 34.1 GALV-RBD corresponds to the activating polypeptide GALV-RBD above defined, 
linked to a ligand which is a single chain antibody directed against MHCL 

- EGF-A-RBD corresponds to the activating polypeptide A-RBD above defined, linked to a 
ligand which is the epidermal growth factor. 

- EFG-E-RBD corresponds to the activating polypeptide E-RBD above defined, linked to a 
30 ligand which is the epidermal growth factor. 

- EGF-GALV-RBD corresponds to the activating polypeptide GALV-RBD above defined, 
linked to a ligand which is the epidermal growth factor. 

10 
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According to the invention, the fusion defective envelope glycoprotein 
advantageously corresponds to the amino acid sequence of a mutated form of an envelope 
glycoprotein of a virus or a retrovirus which, in its non mutated form, presents attachment 
and fusion properties with respect to target cells, said fusion defective envelope 
5 glycoprotein comprising one or several mutations in the amino terminal part of its surface 
subunit, particularly in the first amino acids of the amino terminal part of the surface 
subunit, the number of said first aminoacids varying from about 10 to about 15, said 
mutations inactivating the fusion property without substantially affecting the attachment 
property, in particular comprises one or several mutations within the PHQ amino sequence, ' 
10 wherein in particular the fusion defective envelope glycoprotein is derived from type C 
mammalian retroviruses such as: 

- Murine leukemia virus such as : 

. amphotropic 
. ecotropic 

15 . xenotropic 

. polytropic 
.10A1MLV 

- or GALV type retroviruses and related retrovirus such as SSAV, FELV-A, FELV-B, 
FELV-C, 

20 in particular wherein the fusion defective envelope glycoprotein corresponds to one of the 
following amino acid sequences : 

- MO del H 

- AdelH 

- PROMOdelH 
25 - GAL VdelH 

-C2delH 

- MO del H R less 

- AdelHRless 

- PROMOdelHRless 
30 - GALVdelHRless 

- C2delHRless 
-CMO 
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-TM MO 

- C TM MO 

- C MOdelH 

- TM MOdelH 

5 - C TM MOdelh 
-34.1-A 

* MO del H corresponds to the ecotropic MLV envelope glycoprotein, in which the 
histidine residue of the PHQV motif is deleted (i.e. the 8th residue of the SU subunit is 

10 removed). 

* AdelH corresponds to the amphotropic MLV envelope glycoprotein, in which the histidine 
residue of the PHQV motif is deleted (i.e. the 5th residue of the SU subunit is removed). 

* PROMOdelH corresponds to the amphotropic MLV envelope glycoprotein, in which the 
proline rich region has been replaced by the proline rich region of the ecotropic • MLV 

15 envelope and in which the histidine residue of the PHQV motif is deleted. 

* GALVdelH corresponds to the GALV envelope glycoprotein, (gibbon ape leukemia virus 
envelope glycoprotein), in which the histidine residue of the PHQP motif is deleted (i.e. the 
9th residue of the SU subunit is removed). 

* C2delH corresponds to the amphotropic MLV envelope glycoprotein, in which the proline 
20 rich region has been modified by a small deletion at its carboxy terminus and in which the 

histidine residue of the PHQV motif is deleted (i.e. the 5th residue of the SU subunit is 
removed). 

* MO del H R less corresponds to the ecotropic MLV envelope glycoprotein, in which the 
histidine residue of the PHQV motif is deleted and in which the R peptide of the TM 

25 subunit has been deleted. 

* AdelHRless corresponds to the amphotropic MLV envelope glycoprotein in which the 
histidine residue of the PHQV motif is deleted and in which the R peptide of the TM 
subunit has been deleted. 

* PROMOdelHRless corresponds to the amphotropic MLV envelope glycoprotein, in which 
30 the proline rich region has been replaced by the proline rich region of the ecotropic MLV 

envelope, in which the histidine residue of the PHQV motif is deleted and in which the R 
peptide of the TM subunit has been deleted. 
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* GALVdelHRless corresponds to the GALV envelope glycoprotein, in which the histidine 
residue of the PHQP motif is deleted and in which the R peptide of the TM subunit has been 
deleted. 

* C2delHRless corresponds to the amphotropic MLV envelope glycoprotein, in which the 
5 proline rich region has been modified by a small deletion at its carboxy terminus and in 

which the histidine residue of the PHQV motif is deleted (i.e. the 5th residue of the SU 
subunit is removed) and in which the R peptide of the TM subunit has been deleted. 

* CMO envelope corresponds to the amphotropic MLV envelope glycoprotein, in which the 
SU carboxy-terminal domain, C domain, has been replaced by the C domain derived from 

10 the ecotropic MoMLV envelope glycoprotein. 

* CMOdelH envelope corresponds to the amphotropic MLV envelope glycoprotein, in 
which the SU carboxy-terminal domain, C domain, has been replaced by the C domain 
derived from the ecotropic MoMLV envelope glycoprotein and in which the histidine 
residue of the PHQV motif is deleted (i.e. the 5th residue of the SU subunit is removed). 

15 * TMMO envelope corresponds to the amphotropic MLV envelope glycoprotein, in which 
the ectodomain of the TM subunit has been replaced by the TM ectodomain derived from 
the ecotropic MoMLV envelope glycoprotein. 

* TMMOdelH envelope corresponds to the amphotropic MLV envelope glycoprotein, in 
which the ectodomain of the TM subunit has been replaced by the TM ectodomain derived 

20 from the ecotropic MoMLV envelope glycoprotein and in which the histidine residue of the 
PHQV motif is deleted (i.e. the 5th residue of the SU subunit is removed). 

* CTMMO envelope corresponds to the amphotropic MLV envelope glycoprotein, in which 
both the SU carboxy-terminal domain, C domain, and the ectodomain of the TM subunit 
have been replaced by the C domain and TM ectodomain derived from the ecotropic 

25 MoMLV envelope glycoprotein. 

* CTMMOdelH envelope corresponds to the amphotropic MLV envelope glycoprotein, in 
which both the SU carboxy-terminal domain, C domain, and the ectodomain of the TM 
subunit have been replaced by the C domain and TM ectodomain derived from the ecotropic 
MoMLV envelope glycoprotein and in which the histidine residue of the PHQV motif is 

30 deleted (i.e. the 5th residue of the SU subunit is removed). 



:WO 0071578A3_IA> 



WO 00/071578 



PCT/EP00/04534 



* 34.1-A envelope corresponds to the amphotropic MLV envelope glycoprotein, in which 
the receptor binding domain has been removed and replaced by an single chain antibody 
against human class I MHC molecules. 

5 The invention also relates to the use of at least one activating polypeptide and of at 

least one fusion defective envelope glycoprotein for the preparation of a drag useful for the 
treatment of cancer and infectious diseases. 

The invention also relates to products containing an activating polypeptide and a 
fusion defective envelope glycoprotein as defined above, as a combined preparation for 
10 simultaneous, separate, or sequential use for the rescue of the fusion defective property of 
said fusion defective envelope glycoprotein. 

The invention also relates to products containing an activating polypeptide and a 
fusion defective envelope glycoprotein as defined above, as a combined preparation for 
simultaneous, separate or sequential use for the treatment of cancer pathologies. 
15 The invention also relates, as new products, to activating polypeptides corresponding 

to a fragment of about 200 to about 250 amino acids of said surface subunit comprising the 
RBD and the PHQV peptide, in particular the fragments corresponding to amino acid 
sequences derived from type C mammalian retroviruses such as: 
- Murine leukemia virus such as : 
20 . amphotropic 

. ecotropic 
. xenotropic 
. polytropic 
. 10 Al MLV 

25 - or GALV type retroviruses and related retrovirus such as SSAV, FELV-B 

in particular wherein the activating polypeptide corresponds to one of the following amino 

acid sequences : 

A-RBD 

E-RBD 
30 GALV-RBD 

34-1-A-RBD 

34-1-E-RBD 
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34-1-GALV-RBD 
EGF-A-RBD . 
EGF-E-RBD 
EGF-GALV-RBD 

5 The invention also relates, as new products, to fusion defective envelope 

glycoprotein to the amino sequence of an envelope glycoprotein of a virus or a retrovirus 
which in its native state presents attachment and fusion properties with respect to target 
cells, and comprises one or several mutations in the amino terminal part of its surface 
subunit, inactivating the fusion property without substantially affecting the attachment 
10 property, in particular comprises one or several mutations within the PHQ amino sequence 
of the amino terminal part of its surface subunit, wherein in particular the fusion defective 
envelope glycoprotein is derived from type C mammalian retroviruses such as: 

- Murine leukemia virus such as : 

. amphotropic 
15 . ecotropic 

. xenotropic 
. polytropic 
. 10 Al MLV 

- or GALV type retroviruses and related retrovirus such as SSAV, FELV-A, FELV-B or 
20 FELV-C 

in particular wherein the fusion defective envelope glycoprotein corresponds to one Of the 
following amino acid sequences : 

- AdelH 

- PROMOdelH 
25 - GALVdelH 

- C2delH 

- AdelHRless 

- PROMOdelHRless 

- GALVdelHRless 
30 - C2delHRless 

-CMO 
-TMMO 

15 
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- C TM MO 

- C MOdelH 

- TM MOdelH 

- C TM MOdelh 
-34.1-A 

with the proviso that they are different from the ecotropic MLV sequence, wherein the 
amino acids in position 2 to 8 are either substituted or deleted. 

The invention also relates to the use of at least one activating polypeptide to the 
rescue of the fusion property of a fusion defective envelope glycoprotein wherein the fusion 
defective envelope glycoprotein corresponds to the amino acid sequence of a mutated form 
of an envelope glycoprotein of a virus or a retrovirus which, in its non mutated form, 
presents attachment and fusion properties with respect to target cells, said fusion defective 
envelope glycoprotein being advantagously derived from type C mammalian retroviruses 
such as: 

- Murine leukemia virus such as : 

. amphotropic 
.. ecotropic 
. xenotropic 
. polytropic 
. 10 Al MLV 

- or GALV type retroviruses and related retrovirus such as SSAV, FELV-A, FELV- 
B, FELV-C and comprising : 

* one or several mutations in the amino terminal part of its surface subunit, 
particulary in the first amino acids of the amino terminal part of the surface subunit, the 
number of said first aminoacids varying from about 10 to about 15, said mutations 
inactivating the fusion property without substantially affecting the attachment property, in 
particular comprises one or several mutations within the PHQ amino-terminal sequence, 
wherein in particular the fusion defective envelope glycoprotein corresponds to one of the 
following amino acid sequences : 

- MO R less del H 

- MO del H 
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- A R less del H 

- AdelH 

- PROMO R less del H 

- PROMOdelH 

5 - GALV R less del H 

- GALVdelH 

- C2 R less del H 

- C2delH 

- C MOdelH 
10 - TM MOdelH 

- C TM MOdelh 

* or the replacement of the ammo-terminal domain of its surface subunit (RBD) by a 
ligand such as a growth factor, or a cytokine or a single chain antibody, the extent of said 

15 mutations inactivating the fusion property and the attachement property to the normal viral 
receptor, in particular wherein the fusion defective envelope glycoprotein corresponds to the 
following amino-acid sequence : 
-34.1-A 

* or the replacement of its SU carboxy-terminal domain and / or its TM ectodomain, 
and preferably its SU carboxy-terminal domain, by the corresponding SU carboxy-terminal 
domain and / or the TM ectodomain of another "second" glycoprotein derived from type C 
mammalian retroviruses, with the proviso that said fusion defective glycoprotein and said 
second glycoprotein are not derived from the same type C mammalian retroviruses, in 
particular wherein the fusion defective envelope glycoprotein corresponds to one of the 
following amino-acid sequences : 

- CMO 

- TM MO 

- C TM MO 
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FIGURE LEGENDS 

Fig 1A, IB and 1C represent titres of AdelH retroviruses in the presence of soluble 
envelope fragments of the invention. Retroviral vectors carrying VSV-G or/and wild-type 
5 amphotropic, A, envelope glycoproteins were used as controls. Titres expressed as lacZ 
infectious units (i.u.) per ml of viral supernatants. In figures 1A, IB and 1C, the titres are 
on the y-axis. 

Fig 1A. Different target cell types (XC, NIH3T3, CHO-PIT-2, TE 671, from left to right) 
were incubated with both lacZ retroviruses and conditioned media containing, or not 
10 (control), the indicated polypeptides during the 3 hrs of infection. The columns with vertical 
lines correspond to the control, the columns with horizontal lines correspond to A-RBD and 
the hatched columns correspond to SU polypeptide. 

Fig IB, XC target cells were incubated with dilutions of A-RBD (x axis) and infected with 
LacZ retroviruses carrying the AdelH envelope glycoproteins (curve with black circles), A 
15 envelope glycoproteins (curve with black squares) or VSV-G envelope glycoproteins (curve 
with hollow squares) . 

Fig 1C. Retroviral vectors carrying A envelope glycoproteins (curves with black squares) 
or AdelH envelope glycoproteins (curves with black circles) were incubated with XG target 
cells for 1 hr at 4°C to allow virion binding while preventing cell entry. After PBS washing 

20 to remove unbound retroviruses, cells were incubated at 37°C with undiluted A-RBD- 
containing supernatants for the indicated times of dilution represented on the x axis, the 
titres being expressed on the y axis. A-RBD was then eliminated from the cell supernatant 
by washing the cells four times with 1 ml PBS (resulting in dilution of unbound A-RBD by 
more than 100,000 times). Cells were then grown in regular medium for 2 days before X- 

25 Gal staining. 

Fig 2. represents the titers of A-RBD-mediated activation of AdelH retroviruses. A- 
RBD (used undiluted) was bound to XC target cells at 37 °C for 30 min. Unbound A-RBD 
was removed and cells were washed two times with PBS and incubated at 37 °C. At the 
30 indicated times, cells were further washed two times with PBS and incubated at 37°C for 3 
hrs with retroviruses carrying A envelope glycoproteins or AdelH envelope glycoproteins. 
Titers, determined 2 days later by X-Gal staining, are expressed as lacZ i.u./ml (y-axis), as 
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a function of time before infection (x-axis). The curve with black circles corresponds to 
retrovirus carrying A envelope glycoproteins and the curve with black squares corresponds 
to retrovirus carrying AdelH envelope glycoproteins. 

5 Fig 3. represents titers of AdelH retroviruses after ultrafiltration. Titers expressed on 
the y-axis (as lacZ Lu./ml) of retroviruses carrying wild-type, A, or AdelH envelope 
glycoproteins before (1) or after incubation with A-RBD (2), and following A-RBD elution 
on two successive 700 KD-cut off ultrafiltration cartridges (3). After elution AdelH 
retroviruses were subjected to re-stimulation with fresh A-RBD (4). 

10 

Fig. 4A and 4B represent infection assays of AdelH and MOdelH retroviruses in the 
presence of RBD or RBDdelH polypeptides. 

Fig. 4A. Retroviruses carrying A (upper left) or AdelH (upper right) envelope 
glycoproteins were mixed with A-RBD (hatched column) or A-RBDdelH (column with 
15 inverse hatched lines) during infection of XC cells. Retroviruses carrying MO (lower left) 
or MOdelH (lower right) envelope glycoproteins were mixed with E-RBD (hatched column) 
or E-RBDdelH (column with inverse hatched lines) during infection of XC cells. Titers are 
expressed as lacZ i.u./ml on the y-axis. In Figure 4A, the black column corresponds to 
control. 

20 Fig, 4B. represents binding assays of A-RBD (solid lines) and A-RBDdelH fragments 
(broken lines) on Cearl3 (top), and binding assays of E-RBD (solid lines) and E-RBDdelH 
fragments (broken lines) on Cearl3 cells (bottom). The background of fluorescence 
(hatched histograms) was provided by incubating the cells with supernatants devoid of 
envelope fragments. 

25 Fig 5A and SB represent the cross-activation of AdelH and MOdelH retroviruses by 
MLVRBDs. 

Fig. 5A. Titers of retroviruses carrying AdelH or MOdelH envelope glycoproteins in the 
presence of the indicated RBDs. Titers are determined (y-axis) on XC target cells as lacZ 
i.u./ml. On figure 5A, the column with vertical lines corresponds to control, the hatched 
30 column corresponds to A-RBD and the column with inverse hatched lines corresponds to E- 
RBD. 

J; 
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Fig. SB, Titers of AdelH retroviruses (y-axis) on CHO cells expressing, or not (control), 

PiT-2 and/or mCAT-1 MLV receptors in the presence of the indicated RBDs. On figure 5B, 

the hatched column corresponds to E-RBD, the column with inverse hatched lines 

corresponds to A-RBD and the column with vertical lines corresponds to control. 
5 Fig. 6 represents the A-RBD sequence 

Fig. 7 represents the E-RBD sequence 

Fig. 8 represents the GALV-RBD sequence 

Fig. 9 represents the EGF-A-RBD sequence 

Fig. 10 represents the EGF-E-RBD sequence 
10 Fig. 11 represents the EGF-GALV-RBD sequence 

Fig. 12 represents the 34.1-A-RBD sequence 

Fig. 13 represents the 34.1-E-RBD sequence 

Fig. 14 represents the 34. 1-GALV-RBD sequence 

Fig. 15 represents the AdelH sequence 
15 Fig. 16 represents the PROMOdelH sequence 

Fig. 17 represents the GALVdelH sequence 

Fig. 18 represents the C2delH sequence 

Fig. 19 represents the AdelHRless sequence 

Fig. 20 represents the PROMOdelHRless sequence 
20 Fig. 21 represents the GALVdelHRless sequence 

Fig. 22 represents the C2delHRless sequence 

Fig. 23 represents the C MO sequence 

Fig. 24 represents the C MO delH sequence 

Fig. 25 represents the TM MO sequence 
25 Fig. 26 represents the TM MOdelH sequence 

Fig. 27 represents the C TM MO sequence 

Fig. 28 represents the C TM MO delH sequence 

Fig. 29A represents the schematic structure of chimeric amphotropic envelope glycoproteins 
in which the C domain (CMO chimera), the TM ectodomain (TMMO chimera) or both C 
30 and TM domains (CTMMO chimera) are replaced by the ecotropic counterpart derived 
from MoMLV envelope glycoprotein. The CMOdelH, TMMOdelH and CTMMOdelH 
chimeric envelope glycoproteins harbor the H5del mutation. 
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Figure 29B represents the kinetic of infection of retroviral vectors carrying the wild-type 
amphotropic envelope glycoprotein (A) or the CTMMO araphotropic chimera on XC cells 
in the presence, or not, of soluble polypeptides encompassing the amphotropic RBD (A- 
RBD) or the ecotropic RBD (E-RBD), as indicated. 
5 The curve with points and black circles corresponds CTMMO on XC cells + E-RBD 
The curve with points and hollow circles corresponds to A on XC cells + E-RBD 
The curve with continuous line and hollow circles corresponds to A on XC cells 
The curve with dotted line and hollow circles corresponds to A on XC cells + A-RBD 
The curve with continuous line and black circles corresponds to CTMMO on XC cells 
10 The curve with dotted line and black circles corresponds to CTMMO on XC cells + A- 
RBD 

The X axis corresponds to the time expressed in seconds. 
The Y axis corresponds to the number of infected cells. 
Fig, 30 represents the 34.1-A Sequence 

15 

Example 1: 

MATERIALS AND METHODS 

Cell lines. The TELCeB6 cell line (Cosset et ah, 1995b) has been derived from the TELac2 
20 line after transfection and clonal selection of a Moloney murine leukemia virus (MoMLV)- 
based expression plasmid to produce Gag and Pol proteins. The TELac2 cells were 
originally derived from the TE671 human rhabdomyosarcoma cells (ATCC CRL8805) to 
express the nlsLacZ reporter retroviral vector (Takeuchi et aL, 1994). Production of 
infectious retroviral particles by TELCeB6 cells depends on newly introduced envelope 
25 expression vectors. 

Cerd9 and Cearl3 cells (Kozak et aL, 1995) and CHO-PiT-2 cells (Rodrigues and Heard, 
1999) were derived from CHO (Chinese hamster ovary) cells (ATCC CCL-61) and express 
ecotropic MLV receptors alone or both ecotropic and amphotropic receptors or amphotropic 
MLV receptors alone, respectively. 
30 XC-A-RBD cells were derived from XC rat sarcoma cells (ATCC CCL-165) after 
transfection with the pA-ST plasmid expressing the amino terminal receptor binding domain 
of the amphotropic envelope glycoprotein (Battini et aL , 1996). 
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TE671 and TELCeB6 cells were grown in DMEM (Life-Technologies) supplemented with 
10% fetal bovine serum. NIH-3T3 mouse fibroblasts were grown in DMEM supplemented 
with 10% new-born calf serum. Cerd9, Cearl3, CHO-PiT-2 and CHO cells were grown in 
DMEM supplemented with 10% fetal bovine serum and with proline (Life-Technologies). 

Construction of envelope expression vectors. Plasmids FBASALF and FBMOSALF 
carrying a phleomycin-resistance gene and encoding the MLV-4070A amphotropic (noted as 
A) and MoMLV ecotropic (noted as MO) envelope glycoproteins, respectively, have been 
described elsewhere (Cosset et al., 1995a) and were used as backbones for construction and 
expression of envelope mutants. All constructs were generated by PCR-mediated and 
oligonucleotide site-directed mutagenesis (details and sequences available upon request). 
The FBASALF plasmid was modified to produce a cell entry-defective form of the 
amphotropic glycoprotein, designated AdelH envelope, by deleting the 36th codon of the 
4070A env gene (Ott et al., 1990). The resulting mutant envelope glycoprotein, in which 
the 5th residue of the SU envelope subunit was removed, was named AdelH. The 
expression plasmid encoding the fusion-defective MOdelH envelope glycoproteins, 
harboring a deletion of the 8th residue of the SU envelope subunit corresponding to the 41th 
codon of the MoMLV env gene (Shinnick et al., 1981), was derived from FBMOSALF. 
Plasmids encoding soluble receptor binding domains (RBD) were derived from FBASALF 
and FBMOSALF expression vectors. The boundaries of either amphotropic (A-RBD) and 
ecotropic (E-RBD) RBDs, defined as A32244 and A34269, were fused in frame with the 
carboxy-tenninal ends of either envelope glycoproteins by fusion to aminoacids R623 or 
R634, respectively. Residues are numbered starting from the initiation methionine deduced 
from the aminoacid sequences of the 4070A amphotropic MLV (Ott et al., 1990) or the 
Moloney MLV (Shinnick et al., 1981). Expression vectors encoding either A-RBDdelH or 
E-RBDdelH were generated similarly by using the plasmids expressing the AdelH or 
MOdelH envelope glycoproteins. In plasmids encoding A-RBD, E-RBD, A-RBDdelH or E- 
RBDdelH polypeptides, the 11 aminoacid-long VSV tag (YTDIEMNRLGK) (Kreis and 
Lodish, 1986) was inserted after either RBDs. 

Transfections, production of RBDs and infection assays. Envelope glycoproteins 
expression plasmids were transfected into TELCeB6 as reported elsewhere (Cosset et al., 
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1995a). RBD expression vectors were transfected by calcium phosphate precipitation in 
NEH-3T3, XC or TE671 cells. Transfected cells were selected with phleomycin (50 |ag/ml) 
and phleomycin-resistant colonies were pooled. 

RBD-containing supernatants were collected after an overnight production from confluent 
5 RBD-transfected cells, filtrated trough 0.45 micron pore-sized membranes and stored at 
4°C. Virus-containing supernatants were collected after an overnight production from 
freshly confluent env-transfected TELCeB6 cells in regular medium. 

The supernatant were filtered trough 0.45 micron pore-sized membranes and diluted in 
DMEM for titration assays. Target cells were seeded in 24 multi-well plates at a density of. 

10 5x10* cells per well and incubated overnight at 37°C. Unless otherwise indicated in figure 
legends, 200 p\ of conditioned cell culture media containing the RBDs were added to the 
cells after removing their supernatants. Then 200 jal of viral supernatant dilutions containing 
5 jLLg/ml polybrene were added and cells were incubated for 3 hrs at 37 °C. Cell supernatants 
were then removed and cells were incubated in regular medium for 48 hrs. X-Gal staining 

15 and viral titer determination were performed as previously described (Cosset et al., 1995a) 
and expressed as lacZ i.u. (infectious units)/ml. 

Antibodies. Anti-SU: a rat monoclonal antibody 83A25 (Evans et al., 1990) cell culture 
supernatant against MLV SU used undiluted for FACS analysis. Anti-VSV tag: a purified 
20 mouse monoclonal antibody P5D4 (SIGMA-Aldrich) used diluted 1/100 for FACS analysis. 

Binding assays and FACS analysis. For binding assays, target cells were washed in PBS 
and detached by a 10 min incubation at 37°C with versene 0.02% in PBS. Cells were 
washed in PBA (PBS with 2% FCS and 0.1% sodium azide). 5xl0 5 cells were incubated 

25 with 1 ml of conditioned supernatants containing the VSV tagged RBDs for 45 min at 37 °C. 
Cells were then washed with PBA and were incubated with the P5D4 antibody, for 45 min 
at 4°C. Cells were washed twice with PBA and incubated with anti-mouse FITC-conjugated 
antibodies (DAKO, U.K.), respectively, 5 min before the two final washes in PBA, cells 
were counter-stained with 20 |a,g/ml propidium iodide. Fluorescence of living cells was 

30 analysed with a fluorescent-activated cell sorter (FACSCalibur, Beckton Dickinson). 

RESULTS 
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HSdcl phenotype can be efficiently compensated by soluble A-RBD. Viral particles were 
generated with either wild-type amphotropic A or AdelH envelope glycoproteins and used to 
infect target cells in the presence or in the absence of soluble SU or soluble A-RBD, an 
envelope fragment encompassing the amphotropic receptor binding domain. When SU or A- 
5 RBD were provided during infection the infectivity of retroviruses carrying wild-type 
amphotropic envelope glycoproteins was decreased by approximately 5-100 times (Fig. 1A), 
most likely owing to the partial blocking of PiT-2 receptors on the target cell surface. In 
striking contrast, while in the absence of envelope fragments retroviruses generated with 
AdelH envelopes were very poorly infectious, with titers in the range of lO^lO 2 lacZ i.u. 

10 (infectious units)/ml, the presence of SU or of A-RBD in the medium of infected cells could 
strongly stimulate the infectivity of the former retroviruses by up to 30,000 fold (Fig. 1A). 
Stimulation of AdelH infection with either soluble envelope fragments was detected on a 
wide range of target cell types including rat XC cells, 3T3 murine fibroblasts, TE671 
human cells and PiT-2-transfected CHO hamster cells, though with different efficacies (Fig. 

15 1A). Significant differences were detected when soluble SU or A-RBD were used to activate 
the AdelH retroviruses. Comparatively to soluble SU, A-RBD was 100 to 1,000 fold more 
potent to activate AdelH retroviruses (Fig. 1A). These results indicated that the 
determinants of activation were located in the first half of the SU protein. Therefore to 
facilitate the characterization of this activation pathway, the subsequent experiments were 

20 performed by using the soluble RBD. A-RBD could fully activate AdelH retroviruses when 
diluted 10 fold and retained 90% of activity at a 1:100 dilution, thus suggesting that it was 
effective at very low doses (Fig. IB). In contrast the infectivity of control retroviruses 
pseudotyped with either wild-type amphotropic envelopes or with VSV-G glycoproteins was 
weakly increased or unchanged, respectively, when A-RBD was diluted (Fig. IB). 

25 Additionally the origin of the cell types used to produce the envelope fragments did not 
influence the stimulation of infectivity since conditioned media harvested from 3T3, TE671 
or XC cells that were engineered to express the envelope fragments could similarly activate 
the infectivity of AdelH retroviruses (data not shown). Finally activation of AdelH 
retroviruses was found to be the result of a specific interaction of A-RBD with the PiT-2 

30 amphotropic receptor. Indeed while A-RBD could efficiently rescue the infectivity of AdelH 
retroviruses on PiT-2-transfected CHO cells, no infection could be detected when using 
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parental CHO cells as target cells (see below and Fig. 5A). Altogether these data 
demonstrated that activation of AdelH retroviruses by A-RBD was specific. 
The capacity of the soluble envelope fragments to rescue the infectivity of AdelH 
retroviruses was similar whether the target cells i) constitutively expressed and secreted the 
5 envelope fragments, ii) were co-incubated with both AdelH retroviruses and A-RBD, hi) 
were pre-incubated with A-RBD prior to infection, or iv) were pre-incubated with the 
AdelH retroviruses before adding A-RBD (Table 1). Using this latter experimental 
condition, the kinetic of activation of infection was found very rapid. Indeed a brief 
incubation of target cells with A-RBD, by no more than 30 sec, was found sufficient to fully 

10 stimulate the infectivity of pre-bound AdelH retroviruses (Fig. 1Q. 

A-RBD stimulates the target cell membrane rather than the AdelH envelope. In theory, 
it is possible that the envelope fragments might rescue the infectivity of AdelH retroviruses 
either by interacting with the fusion-defective viral particles or by activating the target cell 
membrane upon receptor binding. Indeed within the envelope complex, the SU subunits are 

15 not tightly attached to either the TM subunits or to the other SU units and can easily shed 
off the viral particles (Hunter and Swanstrom, 1990). However the reassociation of shed SU 
with the viral particle is unlikely since they are incorporated on virions by virtue of their 
association with the TM subunit that results from their synthesis as a common SU-TM 
polypeptide precursor (Hunter and Swanstrom, 1990). Therefore it is unlikely that soluble 

20 SU or A-RBD mixed with AdelH retroviruses may interact with the envelope complex of 
the latter retroviruses and thus rescue their fusion defect. Nevertheless the following 
experiments were performed to address this possibility. 

A-RBD was bound to target cells at 37 °C for 30 min. After removal of unbound fragments 
by washing target cells, binding of A-RBD was verified by FACS analysis. Cells were then 

25 incubated at 37°C for 0 (TO) to 28 hrs (T28) to allow internalisation of receptor/A-RBD 
complexes (Rodrigues and Heard, 1999) and A-RBD disappearance from the cell surface, as 
checked by FACS analysis (Data not shown). Cells were then washed again to remove A- 
RBD that may have been released by the cells and the AdelH retroviruses were then added 
at the different time points. Comparatively to infection performed at TO, no significant 

30 decrease of infectivity could be detected when the AdelH retroviruses were added until 8 
hrs after the initial binding with A-RBD. Likewise the infectivity of AdelH retroviruses 
added at T20 was at least 50 fold higher than before pre-stimulation of the cells with A- 
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RBD, thus indicating that stimulation of infectivity was durable (Fig. 2). Since A-RBD was 
efficiently internalised and removed from the cell surface after 4 hrs of incubation (Data not 
shown), these results indicated that activation of AdelH retroviruses most likely occurred by 
activation of the cell surface rather than by interaction with the AdelH envelope complex 
itself. 

Further evidence indicated that A-RBD could not associate with the AdelH retroviruses. 
Indeed retroviruses generated with either A or AdelH envelopes and mixed or co-expressed 
with A-RBD were not found to incorporate A-RBD as shown by western-blotting of purified 
viral particles with antibodies recognising A-RBD (data not shown). However these 
experiments could not exclude that undetected quantities of A-RBD might be associated with 
the virions. Therefore to rule out this possibility, a mixture of A-RBD and retroviruses 
carrying either AdelH or wild-type A envelope glycoproteins were separated on two 
consecutive 700 KD-pore sized ultrafiltration columns in order to elute out A-RBD. No 
decrease of infectivity could be detected for retroviruses carrying wild-type amphotropic 
envelopes after the two subsequent ultrafiltrations (Fig. 3), demonstrating that this process 
did not affect the viability of the retroviruses. However while before ultrafiltration the 
AdelH viral particles mixed with A-RBD could efficiently infect XC target cells, as 
expected, only a residual infectivity could be detected after A-RBD elution (Fig. 3), 
indicating that stimulation of AdelH retroviruses was lost after separation from A-RBD. As 
control, the AdelH retroviruses processed through the two columns retained their full 
capacity to be stimulated by newly added A-RBD (Fig. 3). Similar experiments were 
conducted after separation of AdelH retroviruses from A-RBD by sephacryl 
chromatography and led to the same conclusions (data not shown). 

Altogether these data indicated that stimulation of infectivity of the AdelH retroviruses 
proceeded via activation of the target cell membrane, most likely as a consequence of 
interaction of A-RBD with the PiT-2 amphotropic receptor, rather than via interaction with 
the AdelH envelope incorporated on the retrovirus itself. Thus A-RBD could activate the 
target cell membrane which in turn became competent to allow entry of retroviruses 
carrying the fusion-defective AdelH envelope glycoproteins. 

The SU ammo-terminal end activates a cell entry pathway common to type C 
mammalian retroviruses. The histidine residue deleted in the H5del mutant glycoprotein 
belongs to a peptide motif, PHQ, found at the ammo-termini of the SUs of all type C 
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mammalian retroviruses, suggesting a conserved function of this motif. Thus the H8del 
mutant ecotropic envelope glycoprotein (MOdelH), harboring the deletion of the histidine in 
the PHQV peptide motif of the MoMLV SU, is impaired for both cell-cell and virus-cell 
fusion (Bae et al., 1997). Similarly the H9del mutation of the GALV envelope glycoprotein 
5 GAdelH mutant results in impairement of both cell-cell and virus-cell fusion. Further 
experiments were performed to demonstrate that MOdelH, AdelH and GAdelH (delH) 
envelopes were phenotypically similar. Indeed the E-RBD fragment, containing the 
MoMLV ecotropic receptor binding domain, could rescue the infectivity of MOdelH 
retroviruses by up to 1,000 fold (Fig. 4A). Similar results were obtained for GAdelH 

10 retrovirus, whose fusogenicity could be efficiently rescued by soluble envelope fragment 
encompassing the GALV envelope glycoprotein receptor-binding domain. This result 
indicated that fusion activation by soluble envelope fragments was not particular to AdelH 
envelopes and suggested that the MLV SU amincntenninus may convey a signal common to 
other type C mammalian retrovirus which may be required to achieve early post-binding 

15 events. 

To address this hypothesis the delH mutations, H5del, H8del, were introduced in the A- 
RBD or in the E-RBD fragments, respectively. The resultant A-RBDdelH and E-RBDdelH 
could bind their respective cell surface receptors as efficiently as the parental envelope 
fragments (Fig. 4B). They could also decrease the infectivity of retroviruses bearing wild- 

20 type amphotropic or ecotropic glycoproteins, respectively* by up to 10-fold (Fig. 4A). This 
effect was most likely a consequence of partial receptor blocking. However in contrast to A- 
RBD or to E-RBD, neither the A-RBDdelH nor the E-RBDdelH fragments could rescue the 
infectivity of AdelH or MOdelH retroviruses, respectively (Fig. 4A). These data therefore 
demonstrated that the integrity of the amino-terminal ends of either RBD fragments was 

25 absolutely required to activate postbinding functions and suggested that an essential 
determinant of cell membrane activation may reside at the amino-terininal end of the MLV 
RBD. Thus the so-called "receptor binding domain 11 located in the amino-terminal half of 
the MLV SU may in fact be composed of two different entities, one involved in specific 
receptor binding and a second, likely to be non functional in AdelH or MOdelH envelopes, 

30 involved in transmission of a signal that may activate cell entry upon receptor binding. 

Therefore to test whether the binding function and the activating function of the MLV 
receptor binding domain could be uncoupled, cells "infected" with AdelH or MOdelH 
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retroviruses were cross-incubated with either E-RBD or A-RBD polypeptides, respectively. 
Interestingly the infectivity of MOdelH retroviruses could be activated by either A-RBD or 
E-RBD fragments (Fig. 5A). This cross-activation was reciprocal since either fragments 
could activate, with similar efficacies, the infectivity of AdelH retroviruses (Fig. 5A). The 
5 cross-reactivity of either RBDs was dependent on the presence of both mCAT-1 and PiT-2 
receptors on the cell surface since neither PiT-2-transfected CHO cells nor mCAT-1- 
transfected CHO cells could be infected when E-RBD was used to activate the infectivity of 
AdelH retroviruses (Fig. 5B) and vice-versa (data not shown). The specificity of activation 
of AdelH/MOdelH retroviruses by ecotropic envelope fragments was further demonstrated 

10 by using the MOD84K-ST envelope fragment, harboring the D84K point mutation that 
inactivates MoMLV envelope binding (MacKrell et aL, 1996). This fragment could activate 
neither AdelH nor MOdelH retroviruses despite the presence of both amphotropic and 
ecotropic receptors on target cells (Fig. 5A). Cross-activation of AdelH retroviruses could 
also be obtained by using soluble envelope glycoproteins from Gibbon Ape leukemia virus 

15 (data not shown), thus indicating that infection by delH retroviruses could be activated by a 
pathway common to type C mammalian retrovirus. 

Table 1. Activation of AdelH retroviruses by A-RBD 



20 Cells Pre-binding a binding and infection b Titre c 



XC - lacZ(AdelH) SxlO 1 

XC - lacZ(AdelH) d 4- A-RBD d 3.1xl0 5 

XC-A-RBD e - lacZ(AdelH) 9xl0 4 

25 XC A-RBD lacZ(AdelH) 1.1x10 s 

XC lacZ(AdelH) A-RBD 2.7xl0 5 



a: Target cells were pre-incubated for one hi at 4°C with conditioned supernatants 
containing A-RBD (used undiluted) or with lacZ retroviral vectors carrying AdelH envelope 
30 glycoproteins, as indicated. 

b: Target cells were incubated for 3 hrs at 37°C. 

c: Titres as lacZ i.u./ml determined 48 hrs post-infection. 

d: Both A-RBD (used undiluted) and AdelH retroviruses were added at the same time. 
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e: XC cells constitutively expressing A-RBD polypeptides. 

Example 2: Control of syncytia formation by RBD polypeptides 

5 Ideally, the transfer of therapeutic genes into tumour cells should lead both to direct 

local cell destruction and to activation of anti-tumour immunity to clear tumour deposits to 
which the genes cannot be delivered. The most commonly used genes for the control of 
local tumour growth have been the suicide gene/prodrug systems such as Herpes Simplex 
Virus thymidine kinase (HSVtk)/Ganciclovir (GCV) or cytosine deaminase (CD)/5FC 

10 systems. Importantly, these systems not only kill gene-transduced target cells but also close 
neighbours via local bystander effects. These effects depend upon cell-cell communication 
or on the diffusion of toxic metabolites, but generally act only over short ranges. 
Nonetheless, the existence of such bystander effects are crucial to compensate for the 
relatively poor efficiencies of gene transfer that are currently possible. However, results 

15 from early clinical trials have demonstrated that both better delivery systems and more 
effective genes are clearly still required to improve the efficiency of both local tumour 
control and of anti tumour immune responses. Hence, therapeutic genes with greater local 
killing activity would represent a significant advance for gene transfer strategies for cancer. 
It has been recently demonstrated that the well-known ability of several viral 

20 fiisogenic membrane glycoproteins (FMG) to fuse target cells could be exploited 
therapeutically to kill tumour cells. These previous results introduced the FMG as a novel 
class of therapeutic genes which are more potent than existing suicide gene systems in 
killing tumour cells and do so by mechanisms which are highly immimostimulatory. Table 2 
shows the results of four FMGs derived from the non fiisogenic ecotropic or amphotropic 

25 murine leukemia virus glycoproteins, whose fusogenicity were increased by deleting the 
p2R peptide (naturally removed during retrovirus egress) for the ARless and MoRless FMG 
and also by modifying the proline-riche region (Lavillette et al., 1998) for the C2Rless and 
PROMORless FMGs. 

The four resulting FMGs, Arless, C2Rless and PROMORless and MORless induced 
30 syncytia formation in most of the cell types that express amphotropic receptors and/or 
ecotropic receptors. However for selective eliminations of cells as required in cancer gene 
therapy protocols, the design of a mechanism that can control the formation of syncyia is 
required. We therefore introduced in all four FMGs the H5del or the H8del mutation (delH 
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mutation) which removes the 5th or the 8 th residue of the arnphotropic or the ecotropic SU, 
respectively, an histidine belonging to the PHQV conserved motif. This mutation could 
inactivate an early event of the fusion process after receptor binding (Example 1 and 
Lavillette et al., 2000). As previously demonstrated the virus-cell fusion activity of delH 
5 MLV envelopes bearing the delH mutation can be efficiently rescued by using polypeptides 
encompassing the first 213 amino-acids of arnphotropic MLV SU (A-RBD) or the first 236 
amino-acids of ecotropic MoMLV SU (E-RBD) (example 1 and Lavillette et al., 2000). 
Consistently with Example 1, the data in Table 2 demonstrated that the cell-cell fusion 
activity of the four FMGs engineered to harbor the delH mutation, AdelRless, 
10 C2delHJRless, and PROMOdelHRless and MOdelHRless, could be rescued by providing 
either the A-RBD or E-RBD polypeptides thus leading to syncytia formation and elimination 
of target cells that express the relevant viral receptors. Interestingly formation of syncytia 
by delH FMGs could be achieved by using an heterologous polypeptide (E-RBD) that 
interacted with a receptor different of that recognised by the delH FMGs. 



15 



20 



30 



Table 2. Modulation of cell-cell fusion by soluble envelope fragments 

Envelope RBD Syncytia ^ delH envelope RBD Syncytia 



ARless - ++ AdelHRless 

25 C2Rless - + + + C2delHRless 

PROMORless - + + + + PROMOdelHRless 

MORless - + + + + MOdelHRless 



A-RBD 


+/- 


E-RBD 


+ 


A-RBD 


+/- 


E-RBD 


+ + 




+/- 


A-RBD 


+ + 


E-RBD 


+ + + + 


A-RBD 


+ + + + 


E-RBD 


+ + + + 



35 fusion indexes (calculated according to Lavillette et al., 1998) -: less than 100, +/-: 100-304, 
+ : 300-1000, + + : 1000-10 000, + + + : 10 000-15 000, + + + + :> 15000 



Of note, similar results of fusion activation with RBD polypeptides were obtained 
40 with the GALVdelHRless envelope glycoprotein, indicating that the delH mutation of the 
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potent GALVRless FMG (Bateman et al., 2000) can be controlled by adding a soluble Env- 
derived polypeptide (Lavillette and Cosset, unpublished results). These data indicate that it 
is possible to exploit a well-known phenomenon - the fusion of virally infected cells with 
uninfected cells as a novel, potent cytotoxic therapy using gene transfer. There are several 
5 options to exploit the use of delH FMGs in cell elimination stategies. Firstly viral vectors 
such as adenovirus vectors can be used to transduce both the delH FMG and the gene 
encoding the activating RBD polypeptide in cancer cells in vivo. Secondly cells transduced 
in vitro or ex vivo by a vector expressing a delH fusion-defective FMG can be induced to 
form syncytia after inoculation of an RBD presented as a protein preparation. In addition as 
10 well as using direct gene transfer, delH FMG and RBD can be effectively delivered through 
cell-mediated mechanisms, which will also be useful as an option for in vivo delivery of 
FMG into tumours for both local tumour eradication as well as enhancement of anti tumour 
vaccination strategies. 

15 Example 3: Addition of ligands on RBD polypeptides 

To further control the fusion activation of FMGs by RBD polypeptides, a ligand was 
inserted at the amino-terminal end of the RBD. The anti-MHC-I 34.1 scFv was chosen as 
we previously demonstrated that it could redirect the host-range of retroviral vectors 

20 carrying 34. 1-ecotropic MLV chimeric envelope glycoproteins and allow specific infection 
of human cells (Marin et al. 1996). Therefore the 34.1 scFV was fused to A-RBD or to E- 
RBD and preparation of 34.1- A-RBD or 34.1-E-RBD polypeptides were obtained. 
Retroviral vectors carrying wild-type envelope glycoproteins, A, or AdelH fusion-defective 
amphotropic envelope glycoproteins were used to infect target cells in the presence of A- 

25 RBD, E-RBD, 34.1-E-RBD or 34.1-A-RBD polypeptides (Table 3). While either 
polypeptides could only weakly decrease the infectivity of parental retroviruses, they could 
strongly stimulate the infectivity of AdelH retrovirus. No significant differences could be 
detected when comparing the A-RBD, the E-RBD, the 34.1-E-RBD or the 34.1-A-RBD 
polypeptides in activation of AdelH retrovirus infectivity (Table 3). 

30 
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Table 3. Modulation of virus-cell fusion by ligand-displaying 
envelope fragments 



5 


Envelone 


RED frasment 


Titers 




A 




8.1xl0 6 






A-RBD 


5.6x10 s 






E-RBD 


8xl0 6 






34 1 -A-RBD 


1 35xl0 6 


10 




34.1-E-RBD 


3xl0 6 




AdelH 




2X10 1 






A-RBD 


8.5xl0 4 






E-RBD 


8x10 s 


15 




34.1-A-RBD 


7.3xl0 4 






34.1-E-RBD 


7x10 s 



These data therefore suggest that it is possible to control the fusion activity of FMGs by 
20 adding specific ligands on the activating polypeptides. Therefore the design of a specific 
system to eliminate unwanted cells may consist in a delH FMG associated to an RBD 
polypeptide that displays a ligand that can specifically recognise a receptor expressed on the 
target cells. Table 4 shows that 34.1-A-RBD and 34.1-E-RBD could activate the cell-cell 
fusion activity of the PROMOdelHRless FMG with an efficiency similar to that of AtRBD 
25 or E-RBD. 

Altogether these results indicate that since the 34.1-E-RBD or 34.1-A-RBD 
polypeptides could cross-activate the fusogenicity of AdelH, AdelHRless, or 
PROMOdelHRless envelope glycoproteins on cells that express the targeted antigen (the 
30 MHC class I molecules) but no receptor corresponding to either the chimeric RBD or the 
delH retrovirus (Tables 3 & 4), efficient retargeted infection or cell-type specific syncytia 
formation could be obtained, 

35 
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Table 4. Modulation of cell-cell fusion by 
iigand-displaying envelope fragments 



Envelope 


RBD 


Syncytia 


A 


- 


18 


ARless 


- 


4464 


ARlessdelH 




14 




RBD 


1014 


PROMORless 




14796 


PROMORlessdelH 




304 




A-RBD 


18920 




E-RBD 


17000 




34.1-A-RBD 


19000 




34.1-E-RBD 


20000 



Example 4: Activation of Env fusogenicity by RBD and C/TM domains 
20 interaction 

The H5del mutation was introduced into the CMO, the TMO and the CTMMO 
MLV amphotropic envelope glycoprotein (Lavillette et al., in preparation), derived from the 
MLV 4070A envelope glycoprotein by substitution of the SU carboxy-terminal domain (C 

25 domain) and/or TM ectodomain with those derived from the MLV ecotropic envelope 
glycoprotein (Figure 29A). Recent evidence from the literature suggest that the C domain of 
MLV envelope glycoproteins is involved in activation and/or control of the fusion 
machinery contained in the TM subunit of the envelope glycoprotein (Nussbaum et al., 
1993; Pinter et al., 1997). Retroviruses carrying either the parental or the delH-mutated 

30 CMOdelH, TMOdelH and CTMMOdelH chimeric envelope glycoproteins were prepared 
and their infectivity was assayed in the presence or the absence of A-RBD or E-RBD 
soluble polypeptides (Table 5). While the infectivity of AdelH control retroviruses could 
easily be rescued when either A-RBD or E-RBD polypeptides were present in the infection 
mixture (Lavillette et al., 2000), the infectivity of CMOdelH, TMMOdelH and 

35 CTMMOdelH retroviruses could not be rescued with the A-RBD polypeptide although it 
was efficiently rescued with the E-RBD polypeptide (Table 5). These results indicated that 
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the fusogenicity of the CMOdelH, TMMOdelH and CTMMOdelH envelope glycoproteins 
could only be activated by soluble RBD polypeptides derived from the same envelope 
glycoprotein from which the C and/or TM domain were inserted in the delH chimera. 
Further evidence for the requirement of a compatibility between the RBD and the C and/or 
5 TM domains were provided by the finding that the CTMMO retroviruses displayed a 
significantly slower kinetic of infection comparatively to that of retroviruses carrying wild- 
type MLV amphotropic envelope glycoproteins (Figure 29B). In the absence of exogenous 
RBD in the infection mixture, the replication retroviruses carrying the CTMMO envelope 
glycoproteins was delayed, most likely owing to the unability of the MLV 4070A-derived 

10 RBD of the latter to activate their MoMLV-derived C and/or TM domains. Yet the addition 
in the infection mixture of exogenous RBD polypeptides derived from the ecotropic 
MoMLV envelope glycoprotein (E-RBD), matching the C and/or TM domains of the 
CTMMO chimera could restore the infection kinetic of CTMMO retroviruses to a level 
similar if not identical to that of wild-type amphotropic retroviruses (Figure 29B). In 

15 contrast the kinetic of infection of retroviruses carrying wild-type amphotropic envelope 
glycoproteins was not increased in the presence of the E-RBD polypeptide. Additionally, 
infectivity with retroviruses carrying either wild-type amphotropic or CTMMO envelope 
glycoproteins was strongly decreased or inhibited, respectively, when infection was 
performed in the presence of A-RBD (Figure 29B). This inhibitory effect was most likely 

20 due to A-RBD-mediated partial blocking of amphotropic receptors in the case of wild-type 
amphotropic envelope glycoproteins and both receptor interference and incompatibility 
between this MLV amphotropic Env-derived fragment and the ecotropic C domain in the 
case of CTMMO chimera. 

25 Table 5 

Infectivity in the presence of : 



no RBD 


A-RBD 


E-RBD 


10 6 


10 4 


10 6 


10 1 


10 5 


10 5 


10 6 


10 2 


10 6 


10° 


10° 


10 s 


10 6 


10 3 


10 6 


10° 


10 3 


10 5 
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10 6 


10 1 


10 6 


10° 


10° 


10 s 


10 6 


10 6 


10 5 



Example 5: Retargeted gene delivery using RBD fragments. 

We have therefore demonstrated that the RBD of a retroviral envelope glycoprotein 
5 is not only a domain responsible of attaching the viral particle to a cell surface receptor but 
is also a domain that is responsible, upon receptor binding, of fusion activation most likely 
via an interaction with the carboxy-terminal domain of the SU Env subunit (C domain). The 
RBD determinant responsible of this activation appears to be located at its ammo-terminal 
end and can be disrupted by the delH mutation. Thus receptor binding of RBD most 

10 probably promotes a conformational change of this ammo-terminal end which, in turn, is 
rendered competent for inducing further conformational changes in the Env C domain that 
control the fusion machinery located in the TM Env subunit. We surmised that the 
replacement of the RBD of a retroviral envelope glycoprotein by a cell type specific ligand 
may redirect the attachment of viral particles carrying such targeting envelope 

15 glycoproteins, yet specific attachment of these virions would not promote fusion activation 
due to the absence in the novel ligand of a molecular device responsible of lusion activation. 
Such an activation, leading to infection, may therefore be restored by providing in trans a 
soluble RBD in the infection mixture. 

Data in Table 6 demonstrate that retroviral vectors carrying a chimeric envelope 

20 glycoprotein in which the wild-type RBD is replaced by a single-chain antibody against 
human class I MHC molecules cannot infect human cells although they can efficiently bind 
to class 1 MHC molecules expressed at the surface of the target cells. The addition in trans 
of RBD polypeptides during infection can promote the events necessary to allow infection of 
the target cells, demonstrating that retargeted binding of chimeric viral particles can give 

25 rise to productive infection and/or gene delivery when the fusion domains of the targeting 
envelope are activated by an exogenous polypeptide. 

Table 6. RBD-mediated activation of the fusion activity of retargeted envelope 
30 glycoproteins 
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Enva - + RBDb + RBD & competitor 0 

A ■ ~ 106 105 105 

5 34.1-A <101 105 102 

a: LacZ retroviral vectors were generated with the indicated envelope glycoproteins. A, 
wild-type amphotropic Env; 34.1-A, a chimeric envelope glycoprotein derived from the 
MLV amphotropic Env in which the receptor binding domain was replaced by a single 
10 chain antibody (scFv) against human class I MHC molecule (Marin et al, 1995). The 
vectors were used to infect class I positive human cells. Titers as lacZ i.u./ml. 
b: A preparation of A-RBD polypeptide was added during infection. Similar results were 
obtained when the A-RBD polypeptide was added before or after infection with the 34.1-A 
retroviruses. 

15 c: Anti-MHC-I IgGs, corresponding to the Mab which was used to derive the anti-MHC-I 
scFv inserted into the 34.1-A chimera, were used as competitor in this infection assay. The 
anti-MHC-I IgGs were used to block the targeted antigens before infection with the 34.1-A 
retroviruses. 

20 
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CLAIMS 

5 1. Use of at least one activating polypeptide comprising an amino acid sequence 

corresponding to : 

- the amino acid sequence of the surface subunit (SU) of an envelope glycoprotein of a 
first virus, or of a first retrovirus, or 

- an amino acid sequence derived from said surface subunit, or 

10 - a fragment of an amino acid sequence of said surface subunit, encompassing the 

receptor binding domain (RBD) and the amino terminal part of the surface subunit, 

for the rescue of the fusion property of a fusion defective envelope glycoprotein, 
which has the property of attachment to target cells and is inactive with respect to the fusion 
with said target cells, said fusion defective envelope glycoprotein belonging to a second 

15 virus or a second retrovirus of the same type as or of a different type from the above 
designated "first virus or first retrovirus". 

2. Use of an activating polypeptide according to claim 1, wherein the activating 
polypeptide recognizes substantially the same receptors of the target cells as the fusion 

20 defective envelope glycoprotein or wherein the activating polypeptide recognizes receptors 
of the target cells which are different from the ones recognized by the fusion defective 
envelope glycoprotein. 

3. Use according to any one of the claims 1 or 2, wherein the activating 
25 polypeptide and/or the fusion defective envelope glycoprotein is (are) linked to a ligand, 

which is specific with respect to some receptors of the target cells, the ligand being 
preferably linked to the activating polypeptide. 

4. Use according to claim 3, wherein the ligand is chosen among molecules liable 
30 to recognize receptors of target cells, for instance receptors of tumoral cells, such as EGF, 

FGF, VEGF. 
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5. Use according to any one of claims 1 to 4, wherein the activating polypeptide is 
under the form of a product as such, or is expressed by a host, such as a virus or a cell, or 
is expressed, in particular by the target cells, transformed by nucleotide sequences coding 
for said activating polypeptide. 

5 

6. Use according to any one of claims 1 to 5, wherein the fusion defective envelope 
glycoprotein is expressed by a host, such as a virus incorporating in its envelope said fusion 
defective envelope glycoprotein, or such as a cell, transformed by nucleotide sequences 
coding for said fusion defective envelope glycoprotein. 

10 

7. Use according to any one of claims 1 to 6, wherein the activating polypeptide 
corresponds to a fragment of about 200 to about 250 amino acids of said surface subunit 
comprising the receptor binding domain (RBD) and comprising the first amino acids of the 
amino terminal part of the surface subunit, the number of said first aminoacids varying from 

15 about 10 to about 15, in particular comprising the RBD and the PHQ peptide, in particular 
the fragments corresponding to amino acid sequences derived from type C mammalian 
retroviruses such as: 

- Murine leukemia virus such as : 

. amphotropic 
20 . ecotropic 

. xenotropic 
. polytropic 
. 10 Al MLV 

- or GALV type retroviruses and related retrovirus such as SSAV, FELV-A, FELV- 
25 B, FELV-C 

in particular wherein the activating polypeptide corresponds to one of the following 
amino acid sequences : 
A-RBD 
E-RBD 
30 GALV-RBD 
34-1-A-KBD 
34-1-E-RBD 
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34-1-GALV-RBD 
EGF-A-RBD 
EGF-E-RBD 
EGF-GALV-RBD 

5 

8. Use according to any one of claims 1 to 7, wherein the fusion defective envelope 
glycoprotein corresponds to the amino acid sequence of a mutated form of an envelope 
glycoprotein of a virus or a retrovirus which, in its non mutated form, presents attachment 
and fusion properties with respect to target cells, said fusion defective envelope 
10 glycoprotein being advantagously derived from type C mammalian retroviruses such as: 

- Murine leukemia virus such as : 

. amphotropic 
. ecotropic 
. xenotropic 

15 . polytropic 

. 10 Al MLV 

- or GALV type retroviruses and related retrovirus such as SSAV, FELV-A, FELV- 
B, FELV-C and comprising : 

20 * one or several mutations in the amino terminal part of its surface subunit, 

particulary in the first amino acids of the amino terminal part of the surface subunit, the 
number of said first aminoacids varying from about 10 to about 15, said mutations 
inactivating the fusion property without substantially affecting the attachment property, in 
particular comprises one or several mutations within the PHQ aniino-terminal sequence, 

25 wherein in particular the fusion defective envelope glycoprotein corresponds to one of the 
following amino acid sequences : 

- MO R less del H 

- MO del H 

- A R less del H 
30 -AdelH 

- PROMO R less del H 

- PROMOdelH 
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- GALV R less del H 

- GALVdelH 

- C2 R less del H 

- C2delH 

5 - C MOdelH 

- TM MOdelH 

- C TM MOdelh 

* or the replacement of the ammo-terminal domain of its surface subunit (RBD) by a 
ligand such as a growth factor, or a cytokine or a single chain antibody, the extent of said 
mutations inactivating the fusion property and the attachement property to the normal viral 
receptor, in particular wherein the fusion defective envelope glycoprotein corresponds to the 
following amino-acid sequence : 

-34.1-A 

* or the replacement of its SU carboxy-terminal domain and / or its TM ectodomain, 
and preferably its SU carboxy-terminal domain by the corresponding SU carboxy-terminal 
domain and / or the TM ectodomain of another "second" glycoprotein derived from type C 
mammalian retroviruses, with the proviso that said fusion defective glycoprotein and said 
second glycoprotein are not derived from the same type C mammalian retroviruses, in 
particular wherein the fusion defective envelope glycoprotein corresponds to one of the 
following amino-acid sequences : 

-CMO 

- TM MO 

- C TM MO 

9. Use of at least one activating polypeptide and of at least one fusion defective 
envelope glycoprotein for the preparation of a drug useful for the treatment of cancer and 
infectious diseases. 

30 

10. Products containing an activating polypeptide and a fusion defective envelope 
glycoprotein as defined in any one of claims 1 to 9, as a combined preparation for 

.- 
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simultaneous, separate, or sequential use for the rescue of the fusion defective property of 
said fusion defective envelope glycoprotein. 

11. Products containing an activating polypeptide and a fusion defective envelope 
glycoprotein as defined in any one of claims 1 to 9, as a combined preparation for 

5 simultaneous, separate or sequential use for the treatment of cancer pathologies. 

12. Activating polypeptides corresponding to a fragment of about 200 to about 250 
amino acids of said surface subunit comprising the RED and the PHQ peptide, in particular 
the fragments corresponding to amino acid sequences derived from type C mammalian 

10 retroviruses such as: 

- Murine leukemia virus such as : 
. amphotropic 
. ecotropic 
. xenotropic 

15 . poly tropic 

• 10 Al MLV 

- or GALV type retroviruses and related retrovirus such as SSAV, FELV-A, 
FELV-B, FELV-C 

in particular wherein the activating polypeptide corresponds to one of the following 
20 amino acid sequences : 

A-RBD 

E-RBD 

GALV-RBD 

34-1-A-RBD 
25 34-1-E-RBD 

34-1-GALV-RBD 

EGF-A-RBD 

EGF-E-RBD 

EGF-GALV-RBD 



30 



13. Fusion defective envelope glycoprotein to the amino sequence of an envelope 
glycoprotein of a virus or a retrovirus which in its native state presents attachment and 
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fusion properties with respect to target cells, and comprises one or several mutations in the 
amino terminal part of its surface subunit, inactivating the fusion properly without 
substantially affecting the attachment property, in particular comprises one or several 
mutations within the PHQ amino sequence of the amino terminal part of its surface subunit, 
5 wherein in particular the fusion defective envelope glycoprotein is derived from type C 
mammalian retroviruses such as: 

- Murine leukemia virus such as : 

. amphotropic 
. ecotropic 

10 . xenotropic 

. polytropic 
. 10 Al MLV 

- or GALV type retroviruses and related retrovirus such as SSAV, FELV-A, FELV-B 
or FELV-C 

15 in particular wherein the fusion defective envelope glycoprotein corresponds to one of 

the following amino acid sequences : 

- AdelH 

- PROMOdelH 

- GALVdelH 
20 - C2delH 

- A R less del H 

- PROMO R less del H 

- GALV R less del H 

- C2 R less del H 
25 - C MO 

- TM MO 

- C TM MO 

- C MOdelH 

- TM MOdelH 
30 - C TM MOdelh 

-34.1-A 



45 



BNSDOCID: <WO 0071578A3JA> 



WO 00/071578 PCT/EP00/04534 



with the proviso that they are different from the ecotropic MLV sequence, wherein the 
amino acids in position 2 to 8 are either substituted or deleted. 



nlMRnrmirv -w<-> nmiR»n 
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